[WIP] Minor optimizations #8105

andrykonchin · 2020-06-06T11:10:06Z

Addresses the issue #8022

NOTE I didn't analyze the optimizations deeply so some of them can be occasional and will be dropped. Some code can be too quick and straightforward so will be polished and rewritten later.

Profiled Rubocop with stackprof and ran against following Ruby projects:

Rails
Chef
Puppet
aws-ruby-sdk
Rubocop

Rubocop was run with disabled cache and ignored rubocop.yml config of selected Ruby projects

exe/rubocop --force-default-config --cache false --out rubocop.out <DIR>

Current result

Despite the largest bottlenecks were optimized the general effect is not so big and total time of rubocop command is decreased only by 5-10%.

Before submitting the PR make sure the following are checked:

Wrote good commit messages.
Commit message starts with [Fix #issue-number] (if the related issue exists).
Feature branch is up-to-date with master (if not - rebase it).
Squashed related commits together.
Added tests.
Added an entry to the Changelog if the new code introduces user-observable changes. See changelog entry format.
The PR relates to only one subject with a clear title and description in grammatically correct, complete sentences.
Run bundle exec rake default. It executes all tests and RuboCop for itself, and generates the documentation.

marcandre · 2020-06-07T05:02:31Z

lib/rubocop/cop/mixin/classish_length.rb

+        line_numbers.uniq
+      end
+
+      def _each_descendant(node, *types, &block)


Are this and the next method used at all?

marcandre · 2020-06-07T05:06:04Z

lib/rubocop/cop/cop.rb

@@ -142,10 +148,15 @@ def find_location(node, loc)
        loc.is_a?(Symbol) ? node.loc.public_send(loc) : loc
      end

+      # TODO: remove - it's unused


Cop::Cop is being heavily refactored, all these changes will conflict, but thanks for reminding me to optimize duplicate_location? as Source::Range can now be used as hash keys / set elements

@marcandre This optimization can significantly decrease execution time in some cases.

There is a large rubocop.todo.yml in one of my projects. If run Rubocop without config (so there are thousands of errors) this optimization decrease the time from 20 min to 5.

I agree this was a bad implementation (O(n^2) where n is the number of offenses).
You can try with def duplicate_location?(range); !@current_offense_locations.add?(range); end and adding @current_offense_locations = Set.new in the constructor, if you want, you'll also be in the 5 minutes range... This is the implementation I'm using in my branch #7868

marcandre · 2020-06-07T05:06:55Z

lib/rubocop/cop/layout/line_length.rb

@@ -213,7 +213,7 @@ def excess_range(uri_range, line, line_index)
        end

        def max
-          cop_config['Max']
+          @max ||= cop_config['Max']


I highly doubt this makes much of a difference, as cop_config is already cached (as it should) and the rest is just a hash lookup with a frozen string literal which is highly optimized.

marcandre · 2020-06-07T05:08:10Z

lib/rubocop/cop/mixin/tokens_optimized.rb

@@ -0,0 +1,17 @@
+module RuboCop


Right, building a tokens index should make a good difference. I'm curious if we really need to use tokens though. If we do, the best would be to share the cache with a Force, even though they haven't really be designed for this.

bbatsov · 2020-06-07T06:01:01Z

Despite the largest bottlenecks were optimized the general effect is not so big and total time of rubocop command is decreased only by 5-10%.

5-10% are not little. :-)

andrykonchin · 2020-06-07T09:29:43Z

@marcandre Thank you for your review. Will come back to this PR soon.

andrykonchin added 15 commits May 31, 2020 03:59

@wip RuboCop::Cop::Layout::SpaceInsideArrayLiteralBrackets#on_array

001e733

@wip Metrics::BlockLength#on_block

a3079a2

@wip Style::WordArray#on_array

df88856

@wip Regexp TODOs

f2022f2

@wip Fix typos

fc3c6dd

@wip Layout::FirstArgumentIndentation#on_send

842146f

@wip remove todo

fa20fdf

@wip Metrics::ClassLength#on_class

162f35e

@wip RuboCop::Cop::StringHelp#on_str (RuboCop::Cop::Cop#add_offense)

d004db6

@wip Style::NumericPredicate#on_send

ac8310e

@wip RuboCop::Cop::Style::NumericPredicate#on_send

34283fd

@wip LineLength

b58f788

@wip @continue RuboCop::Cop::Style::NumericPredicate#on_send

923d457

@wip Naming::PredicateName#on_def

b52773e

@wip RuboCop::Cop::Layout::SpaceInsideArrayLiteralBrackets#on_array

7a4b978

andrykonchin marked this pull request as draft June 6, 2020 22:57

andrykonchin changed the title ~~[DRAFT][WIP] Minor optimizations~~ [WIP] Minor optimizations Jun 6, 2020

marcandre reviewed Jun 7, 2020

View reviewed changes

andrykonchin closed this Jul 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Minor optimizations #8105

[WIP] Minor optimizations #8105

andrykonchin commented Jun 6, 2020

marcandre Jun 7, 2020

marcandre Jun 7, 2020

andrykonchin Jun 10, 2020

marcandre Jun 10, 2020

marcandre Jun 7, 2020

marcandre Jun 7, 2020

bbatsov commented Jun 7, 2020

andrykonchin commented Jun 7, 2020

[WIP] Minor optimizations #8105

[WIP] Minor optimizations #8105

Conversation

andrykonchin commented Jun 6, 2020

Current result

marcandre Jun 7, 2020

Choose a reason for hiding this comment

marcandre Jun 7, 2020

Choose a reason for hiding this comment

andrykonchin Jun 10, 2020

Choose a reason for hiding this comment

marcandre Jun 10, 2020

Choose a reason for hiding this comment

marcandre Jun 7, 2020

Choose a reason for hiding this comment

marcandre Jun 7, 2020

Choose a reason for hiding this comment

bbatsov commented Jun 7, 2020

andrykonchin commented Jun 7, 2020