[Fix #8708] Fix bad regexp recognition in `Lint/OutOfRangeRegexpRef` when there are multiple regexps #8844

dvandersluis · 2020-10-03T05:29:02Z

Lint/OutOfRangeRegexpRef was using the wrong regexp to determine how many capturing groups there are, in a few cases, that are now fixed. I also updated the error message to be a bit more straightforward because I found it hard to understand, but please let me know if you'd like me to reword or revert.

Also I have no idea what the plural of regexp should be. 😅

Fixes #8708.

Before submitting the PR make sure the following are checked:

Wrote good commit messages.
Commit message starts with [Fix #issue-number] (if the related issue exists).
Feature branch is up-to-date with master (if not - rebase it).
Squashed related commits together.
Added tests.
Added an entry to the Changelog if the new code introduces user-observable changes. See changelog entry format.
The PR relates to only one subject with a clear title and description in grammatically correct, complete sentences.
Run bundle exec rake default. It executes all tests and RuboCop for itself, and generates the documentation.

marcandre · 2020-10-08T08:38:07Z

Update: I started reviewing, love the message changes 👍
It would be quite nice to have a after_send callback of some kind, instead of messing around with ignore things... I'm checking a few things on the Commissionner side.

dvandersluis · 2020-10-08T15:04:27Z

@marcandre thanks! Let me know if you want me to change something.

marcandre · 2020-10-08T16:00:27Z

~~Could you check if it also has this issue? #8708~~

dvandersluis · 2020-10-08T16:07:02Z

I'm not sure what you mean? This PR is for that issue haha 😅

dvandersluis · 2020-10-08T16:20:11Z

Oh I assume you meant #8862 / #8863 ? That's a different cop so there shouldn't be a conflict.

dvandersluis · 2020-10-08T16:24:00Z

Updated to fix the changelog now that 0.93 is out.

marcandre

Oh I assume you meant #8862 / #8863 ? That's a different cop so there shouldn't be a conflict.

Ooops, yes, my double mistake 🤦‍♂️, need coffee apparently

lib/rubocop/cop/lint/out_of_range_regexp_ref.rb

marcandre · 2020-10-08T22:09:58Z

So I checked about adding on_after_xxx and it's sadly in a hotspot, it would cost a 5% slowdown which is pretty sad 😿
I could add it only just for send, wouldn't be so bad, or I could introduce some API to add a callback on the fly, but that feels a bit ugly...
So I'm tempted to just leave this as is (after the conflict is resolved...) even though the logic is quite convoluted just because we don't currently have a way do processing after a particular node's children are processed...

dvandersluis · 2020-10-08T23:54:07Z

@marcandre fixed the conflict, happy to make other changes if you want, just let me know!

…boCop, at around 8% of overall processing. This is because we currently build a new `Commissionner` for every file (could be optimized but not easy as config can change) and a commissioner must know which cops need responding to which node types. Previous method was thus `O(files * cops * types)` where `types` is the number of node types encountered in a typical file. Testing on `files == 200` typical RuboCop files, I calculated `types ~22.5`. Currently `cops == 422`. This new algorithm is the same `O` but `types` is instead the number of types that a cop responds to (on average). For the cops that we run that is ~1.8. The optimized cache building is almost 6x faster, for an *overall gain of ~6.6%* It also has the advantage that adding new callbacks has zero impact on the cache building. Note: I'd like to add `on_after_send` etc (see #8844). The only downside is that the list of callbacks is (by default) cached per Cop class. This means that any Cop that somehow relies on *adding* callbacks it responds to *at runtime* is incompatible. There's a single cop that does this (see #8881). Solutions include: not doing that (as in the PR), or only modifying the callbacks (in this case it could have been done by adding empty methods `on_send`, etc., even though they would be overriden by the extending modules), or overriding `Cop::Base#callbacks_needed` (although I marked the api as private for now). How I tested performance: ``` $ stackprof tmp/stackprof.dump --text --method 'RuboCop::Cop::Commissioner#cops_callbacks_for' samples: 1039 self (7.5%) / 1104 total (8.0%) $ stackprof tmp/stackprof.dump --text --method 'RuboCop::Cop::Commissioner#initialize_callbacks' samples: 25 self (0.2%) / 180 total (1.4%) ```

marcandre · 2020-10-13T01:25:27Z

Update: I opened #8889; I'm waiting a day or two to see if the latest bug fix release is stable and I'll merge that, which would give you the on_after_send that would simplify the processing I believe.

…boCop, at around 8% of overall processing. This is because we currently build a new `Commissionner` for every file (could be optimized but not easy as config can change) and a commissioner must know which cops need responding to which node types. Previous method was thus `O(files * cops * types)` where `types` is the number of node types encountered in a typical file. Testing on `files == 200` typical RuboCop files, I calculated `types ~22.5`. Currently `cops == 422`. This new algorithm is the same `O` but `types` is instead the number of types that a cop responds to (on average). For the cops that we run that is ~1.8. The optimized cache building is almost 6x faster, for an *overall gain of ~6.6%* It also has the advantage that adding new callbacks has zero impact on the cache building. Note: I'd like to add `on_after_send` etc (see #8844). The only downside is that the list of callbacks is (by default) cached per Cop class. This means that any Cop that somehow relies on *adding* callbacks it responds to *at runtime* is incompatible. There's a single cop that does this (see #8881). Solutions include: not doing that (as in the PR), or only modifying the callbacks (in this case it could have been done by adding empty methods `on_send`, etc., even though they would be overriden by the extending modules), or overriding `Cop::Base#callbacks_needed` (although I marked the api as private for now). How I tested performance: ``` $ stackprof tmp/stackprof.dump --text --method 'RuboCop::Cop::Commissioner#cops_callbacks_for' samples: 1039 self (7.5%) / 1104 total (8.0%) $ stackprof tmp/stackprof.dump --text --method 'RuboCop::Cop::Commissioner#initialize_callbacks' samples: 25 self (0.2%) / 180 total (1.4%) ```

marcandre · 2020-10-20T12:51:03Z

after_send is now available...

dvandersluis · 2020-10-20T14:07:17Z

@marcandre cool thanks, I’ll probably Be able to update this tomorrow!

marcandre · 2020-10-20T20:55:27Z

I believe this or can be simplified without using an ignore list...

…

On Thu, Oct 8, 2020, 11:04 Daniel Vandersluis ***@***.***> wrote: @marcandre <https://github.com/marcandre> thanks! Let me know if you want me to change something. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8844 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAIH2WQPHNZ3RGTPGDNXF3SJXII3ANCNFSM4SCNZDPA> .

dvandersluis · 2020-10-21T15:38:07Z

@marcandre updated to use after_send, thanks that really simplified it!

lib/rubocop/cop/lint/out_of_range_regexp_ref.rb

spec/rubocop/cop/lint/out_of_range_regexp_ref_spec.rb

CHANGELOG.md

dvandersluis · 2020-10-21T16:51:03Z

@bbatsov made all the requested changes, thanks!

bbatsov · 2020-10-25T06:54:39Z

Can you also address the changelog conflict?

marcandre · 2020-10-26T09:17:10Z

Can you also address the changelog conflict?

You can use rake changelog:fix and al. in the future 😄

…xpRef` when there are multiple regexps.

dvandersluis · 2020-10-26T15:48:37Z

@bbatsov @marcandre moved the changelog item in changelog/

marcandre · 2020-10-26T19:49:48Z

Outstanding work @dvandersluis 👍
Thanks!

dvandersluis force-pushed the issue/8708 branch from 206a75e to 657d7d8 Compare October 3, 2020 18:18

marcandre mentioned this pull request Oct 7, 2020

Default for Style/FormatStringToken #8827

Open

marcandre self-assigned this Oct 8, 2020

dvandersluis force-pushed the issue/8708 branch from 657d7d8 to fbc4b7a Compare October 8, 2020 16:22

marcandre reviewed Oct 8, 2020

View reviewed changes

lib/rubocop/cop/lint/out_of_range_regexp_ref.rb Outdated Show resolved Hide resolved

dvandersluis force-pushed the issue/8708 branch from fbc4b7a to 8497895 Compare October 8, 2020 18:40

dvandersluis force-pushed the issue/8708 branch from 8497895 to 7ff174f Compare October 8, 2020 23:53

dvandersluis force-pushed the issue/8708 branch 2 times, most recently from 15e88e3 to 243222b Compare October 9, 2020 17:14

marcandre mentioned this pull request Oct 10, 2020

Optimize Commissioner callbacks #8882

Merged

dvandersluis force-pushed the issue/8708 branch from 243222b to f380667 Compare October 21, 2020 15:37

bbatsov reviewed Oct 21, 2020

View reviewed changes

lib/rubocop/cop/lint/out_of_range_regexp_ref.rb Outdated Show resolved Hide resolved

bbatsov reviewed Oct 21, 2020

View reviewed changes

spec/rubocop/cop/lint/out_of_range_regexp_ref_spec.rb Outdated Show resolved Hide resolved

bbatsov reviewed Oct 21, 2020

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

dvandersluis force-pushed the issue/8708 branch from f380667 to 58823be Compare October 21, 2020 16:50

dvandersluis force-pushed the issue/8708 branch from 58823be to 28c8713 Compare October 21, 2020 17:00

dvandersluis added 2 commits October 26, 2020 11:38

Improved offense message for Lint/OutOfRangeRegexpRef

dc7260b

[Fix rubocop#8708] Fix bad regexp recognition in `Lint/OutOfRangeRege…

d202e85

…xpRef` when there are multiple regexps.

dvandersluis force-pushed the issue/8708 branch from 28c8713 to d202e85 Compare October 26, 2020 15:47

marcandre merged commit 3ebc79f into rubocop:master Oct 26, 2020

dvandersluis deleted the issue/8708 branch January 18, 2021 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix #8708] Fix bad regexp recognition in `Lint/OutOfRangeRegexpRef` when there are multiple regexps #8844

[Fix #8708] Fix bad regexp recognition in `Lint/OutOfRangeRegexpRef` when there are multiple regexps #8844

dvandersluis commented Oct 3, 2020

marcandre commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre commented Oct 8, 2020 •

edited

dvandersluis commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre left a comment

marcandre commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre commented Oct 13, 2020

marcandre commented Oct 20, 2020

dvandersluis commented Oct 20, 2020

marcandre commented Oct 20, 2020 via email

dvandersluis commented Oct 21, 2020

dvandersluis commented Oct 21, 2020

bbatsov commented Oct 25, 2020

marcandre commented Oct 26, 2020

dvandersluis commented Oct 26, 2020 •

edited

marcandre commented Oct 26, 2020

[Fix #8708] Fix bad regexp recognition in Lint/OutOfRangeRegexpRef when there are multiple regexps #8844

[Fix #8708] Fix bad regexp recognition in Lint/OutOfRangeRegexpRef when there are multiple regexps #8844

Conversation

dvandersluis commented Oct 3, 2020

marcandre commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre commented Oct 8, 2020 • edited

dvandersluis commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre left a comment

Choose a reason for hiding this comment

marcandre commented Oct 8, 2020

dvandersluis commented Oct 8, 2020

marcandre commented Oct 13, 2020

marcandre commented Oct 20, 2020

dvandersluis commented Oct 20, 2020

marcandre commented Oct 20, 2020 via email

dvandersluis commented Oct 21, 2020

dvandersluis commented Oct 21, 2020

bbatsov commented Oct 25, 2020

marcandre commented Oct 26, 2020

dvandersluis commented Oct 26, 2020 • edited

marcandre commented Oct 26, 2020

[Fix #8708] Fix bad regexp recognition in `Lint/OutOfRangeRegexpRef` when there are multiple regexps #8844

[Fix #8708] Fix bad regexp recognition in `Lint/OutOfRangeRegexpRef` when there are multiple regexps #8844

marcandre commented Oct 8, 2020 •

edited

dvandersluis commented Oct 26, 2020 •

edited