Regression fix: leave R prefixes capitalization alone #2285

ichard26 · 2021-05-30T20:20:56Z

black.strings.get_string_prefix used to lowercase the extracted
prefix before returning it. This is wrong because 1) it ignores the
fact we should leave R prefixes alone because of MagicPython, and 2)
there is dedicated prefix casing handling code that fixes issue 1.
.lower is too naive.

This was originally fixed in 20.8b0, but was reintroduced since 21.4b0.

(Re)fixes GH-1244.

JelleZijlstra · 2021-05-30T20:24:25Z

I think there's code in trans.py that relies on the result of this function being lowercased. See trans.py line 438 for example, where we check for "f" in prefix.

felix-hilden · 2021-06-02T11:23:57Z

Yeah after #2297 I was wondering why there are essentially two functions for getting docstring prefixes. Looking at their usage it seems that get_prefix is used more widely for just information about the prefix and normalize in one place to actually transform the lines. Maybe they should be combined in some way, or separated entirely into strictly getter and transformer. The dependencies to lowercase could very easily be implemented in the calling code.

But there is at least one other place which transforms prefixes, in string merging and splitting.

`black.strings.get_string_prefix` used to lowercase the extracted prefix before returning it. This is wrong because 1) it ignores the fact we should leave R prefixes alone because of MagicPython, and 2) there is dedicated prefix casing handling code that fixes issue 1. `.lower` is too naive. This was originally fixed in 20.8b0, but was reintroduced since 21.4b0. I also added proper prefix normalization for docstrings by using the `black.strings.normalize_string_prefix` helper. Some more test strings were added to make sure strings with capitalized prefixes aren't treated differently (actually happened with my original patch, Jelle had to point it out to me).

felix-hilden · 2021-06-08T13:24:51Z

src/black/strings.py

@@ -87,7 +87,7 @@ def get_string_prefix(string: str) -> str:
    prefix = ""
    prefix_idx = 0
    while string[prefix_idx] in STRING_PREFIX_CHARS:
-        prefix += string[prefix_idx].lower()
+        prefix += string[prefix_idx]


I wonder if this could not repeatedly append to the string and just calculate the index, and then slice the whole prefix at once 🤔 Not really a comment to the PR but came to my mind anyway.

Sadly min(s.find("'"), s.find('"')) doesn't quite work because no matches is returned as -1 😄 and it could be way less efficient

I'm taking this as that no you have no nits or objections for this PR :)

Thanks for the review though! You're doing great adjusting to your new role!

felix-hilden · 2021-06-08T13:27:48Z

Apart from the comment above, the changes seem good to me at least. Leaving all processing to calling code seems appropriate in a function called get.

JelleZijlstra · 2021-06-09T00:45:35Z

tests/data/long_strings__regression.py

+
+fstring = (
+    f"f-strings definitely make things more {difficult} than they need to be for"
+    " {black}. But boy they sure are handy. The problem is that some lines will need"


I like this style of test

ichard26 added the F: strings Related to our handling of strings label May 30, 2021

ichard26 marked this pull request as draft May 30, 2021 20:56

ichard26 force-pushed the fix-string-prefix-regression branch from ca7a34d to c2ce5cf Compare June 4, 2021 20:38

ichard26 marked this pull request as ready for review June 4, 2021 20:39

Merge branch 'main' into fix-string-prefix-regression

04e3741

felix-hilden reviewed Jun 8, 2021

View reviewed changes

JelleZijlstra approved these changes Jun 9, 2021

View reviewed changes

JelleZijlstra merged commit 00e7e12 into main Jun 9, 2021

ichard26 deleted the fix-string-prefix-regression branch June 9, 2021 00:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression fix: leave R prefixes capitalization alone #2285

Regression fix: leave R prefixes capitalization alone #2285

ichard26 commented May 30, 2021

JelleZijlstra commented May 30, 2021

felix-hilden commented Jun 2, 2021 •

edited

felix-hilden Jun 8, 2021

felix-hilden Jun 8, 2021 •

edited

ichard26 Jun 8, 2021

felix-hilden commented Jun 8, 2021

JelleZijlstra Jun 9, 2021

Regression fix: leave R prefixes capitalization alone #2285

Regression fix: leave R prefixes capitalization alone #2285

Conversation

ichard26 commented May 30, 2021

JelleZijlstra commented May 30, 2021

felix-hilden commented Jun 2, 2021 • edited

felix-hilden Jun 8, 2021

Choose a reason for hiding this comment

felix-hilden Jun 8, 2021 • edited

Choose a reason for hiding this comment

ichard26 Jun 8, 2021

Choose a reason for hiding this comment

felix-hilden commented Jun 8, 2021

JelleZijlstra Jun 9, 2021

Choose a reason for hiding this comment

felix-hilden commented Jun 2, 2021 •

edited

felix-hilden Jun 8, 2021 •

edited