Support free-spacing mode `Regexp` in `Naming/InclusiveLanguage` #11832

sambostock · 2023-05-01T02:51:03Z

WIP – See #11831.

This edits the way the Naming/InclusiveLanguage cop combines its config into Regexp to match terms. Specifically, because we would convert given Regexp to Strings via Regexp#source, and then simply combine patterns/strings using

Regexp.new(strings.join('|'), Regexp::IGNORECASE)

we would lose any mode/options information about the Regexp, which in the case of "free-spacing"/"extended" mode, would mean the Regexp could be corrupted by additional spacing being considered part of the pattern.

This changes the approach to normalize patterns as Regexps instead of Strings, and then simply combine them using Regexp.union. To preserve the existing behaviour of always making the patterns case insensitive, we extract the source and options from any given Regexp, and construct a new one where we force case insensitivity using

Regexp.new(regexp.source, regexp.options | Regexp::IGNORECASE)

This way, we preserve all other flags (/.../x, but also others like /.../m).

Note that it was previously possible to work around this by applying options at the subexpression level, which remains possible (e.g. to force case sensitivity).

TODO

Validate approach
Rename identifiers accordingly
Add more specs for edge cases and covering existing behaviour, to ensure it is preserved.
Better document use of Regexp in cop documentation (unclear how AllowedRegexp works; maybe separate PR?)

Before submitting the PR make sure the following are checked:

The PR relates to only one subject with a clear title and description in grammatically correct, complete sentences.
Wrote good commit messages.
Commit message starts with [Fix #issue-number] (if the related issue exists).
Feature branch is up-to-date with master (if not - rebase it).
Squashed related commits together.
Added tests.
Ran bundle exec rake default. It executes all tests and runs RuboCop on its own code.
Added an entry (file) to the changelog folder named {change_type}_{change_description}.md if the new code introduces user-observable changes. See changelog entry format for details.

There appears not to be coverage for this.

sambostock · 2023-05-01T05:26:20Z

lib/rubocop/cop/naming/inclusive_language.rb

-          regex.is_a?(Regexp) ? regex.source : regex
+        def ensure_case_insensitive_regexp(object)
+          case object
+          when Regexp then Regexp.new(object.source, object.options | Regexp::IGNORECASE)


Could optimize to skip creating duplicates

Suggested change

when Regexp then Regexp.new(object.source, object.options | Regexp::IGNORECASE)

when Regexp

return object if object.options == Regexp::IGNORECASE

Regexp.new(object.source, object.options | Regexp::IGNORECASE)

sambostock added 2 commits April 30, 2023 22:06

Add Naming/InclusiveLanguage AllowedRegex specs

814dfd9

There appears not to be coverage for this.

[WIP] Support free-spacing Regexp in Naming/InclusiveLanguage

0ada013

sambostock mentioned this pull request May 1, 2023

Naming/InclusiveLanguage's AllowedRegex does not support free-spacing/extended mode Regexp #11831

Open

sambostock commented May 1, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support free-spacing mode `Regexp` in `Naming/InclusiveLanguage` #11832

Support free-spacing mode `Regexp` in `Naming/InclusiveLanguage` #11832

sambostock commented May 1, 2023

sambostock May 1, 2023

Support free-spacing mode Regexp in Naming/InclusiveLanguage #11832

Are you sure you want to change the base?

Support free-spacing mode Regexp in Naming/InclusiveLanguage #11832

Conversation

sambostock commented May 1, 2023

TODO

sambostock May 1, 2023

Choose a reason for hiding this comment

Support free-spacing mode `Regexp` in `Naming/InclusiveLanguage` #11832

Support free-spacing mode `Regexp` in `Naming/InclusiveLanguage` #11832