Fix type preprocessor #8021

keewis · 2020-07-31T15:54:07Z

Follow-up to #7690. While trying to prepare a project I'm working on for the type preprocessor, I noticed a few issues (which was to be expected, I guess):

Ellipsis and ... are not treated like singletons
the Returns, Yields and Raises (and possibly more) fields ignore the napoleon_type_aliases setting (Edit: see Preprocess other sections #8050)
there are more delimiters in use, for example , or , , and , and and to (for describing the structure of a dictionary)

and possibly more. I'll find and fix those while working on that project.

tk0miya

LGTM with nits.

BTW, could you merge the HEAD of 3.x into your branch please? It is needed to fix the CI error.

sphinx/ext/napoleon/docstring.py

tk0miya · 2020-08-01T04:28:27Z

sphinx/ext/napoleon/docstring.py

@@ -846,8 +848,13 @@ def combine_set(tokens):

 def _tokenize_type_spec(spec: str) -> List[str]:
    def postprocess(item):
-        if item.startswith("default"):
-            return [item[:7], item[7:]]
+        if item.startswith("default") and item != "default":


I noticed this would match "defaultdict" unexpectedly. How about using item.startswith("default ") (append a space after "default") instead?

oh, yes, I didn't consider that. However, when attempting to add the default syntax to the numpydoc format guide someone reminded me that there are lots of projects with different formats, so right now we're trying to come up with a way to resolve this. I think for now it would be good not to restrict the syntax too much. How about something like

Suggested change

if item.startswith("default") and item != "default":

if re.match(r"^default[^_0-9a-zA-Z].*$", item):

As I commented on the last PR, we should not enhance numpydoc itself. We don't need to support dialects of numpydoc on napoleon. I don't object to choose an unrestricted way. But it does not mean supporting them all.

if you look at numpy/numpydoc#289, it seems the format for default is not standardized at all, so we're considering mentioning three of them (default x, default: x and default=x) and state that there might be more. I was thinking of explicitly supporting those three.

No reason to support non-standardized notations. Let see what happens in numpy/numpydoc#289.
It would be better to split a PR to around "default" and others.

fair enough. This might be something that should be discussed as part of that "default" PR, but what do you think about removing : from _token_regex (which I think we shouldn't support if not directly after default) and use something like:

pattern = re.compile( r"^" r"(default)" r"([^_0-9A-Za-z]+)" rf"({_xref_regex}|(?:[_A-Za-z][_A-Za-z0-9]*))?" r"$" ) match = pattern.match(item) if match is not None: return [_ for _ in match.groups() if _ is not None] else: return [item]

in the postprocessing step? That way we "accidentally" support all the default<delimiter><obj> notations but can still decide to officially support / test only a limited set of notations.

In my experience, users who depend on such "accidental" features sometimes report a bug when it will be lost in the unexpected change.

agreed. I guess we should continue this discussion once the numpydoc format guide PR was merged.

I'll accept the change with pleasure after numpydoc guide updated :-)

sphinx/ext/napoleon/docstring.py

…fault

tk0miya · 2020-08-04T15:19:42Z

Merged. Thank you!

keewis added 6 commits July 29, 2020 02:34

move the misplaced GoogleDocstring test to the appropriate test class

0e5964d

add ... and Ellipsis to the singletons referenced by ":obj:"

56666e4

make the postprocessing a bit more robust and add tests

05bf00b

add more delimiters so describing mappings becomes possible

e39c1a8

add tests for referencing ellipsis objects

31809b3

properly link ... to Ellipsis

b353dfe

tk0miya requested changes Aug 1, 2020

View reviewed changes

tk0miya added type:bug extensions:napoleon labels Aug 1, 2020

tk0miya added this to the 3.2.0 milestone Aug 1, 2020

keewis added 3 commits August 1, 2020 13:21

skip whitespace only tokens

a09c170

detect ... as a link

5ee6a03

use complex to check for numerical values

ccd24aa

keewis force-pushed the fix-type-preprocessor branch from d03c580 to 74f2028 Compare August 1, 2020 11:21

keewis added 2 commits August 1, 2020 13:25

use a upper-case name for the list of singleton names

02ff1cc

Merge branch '3.x' into fix-type-preprocessor

3ff956c

keewis force-pushed the fix-type-preprocessor branch from 74f2028 to 3ff956c Compare August 1, 2020 11:26

keewis added 5 commits August 1, 2020 13:38

use a regex to decide whether to postprocess a token starting with de…

92e9cd4

…fault

check that floats and complex numbers are detected as literals

6be806b

only allow "default <obj>" and "default: <obj>" for now

af7d6a5

check that a "default <obj>" notation works with xrefs

fbad78d

make sure strings are not split using other delimiters

47da37e

tk0miya merged commit fcf63a2 into sphinx-doc:3.x Aug 4, 2020

keewis deleted the fix-type-preprocessor branch August 4, 2020 15:20

github-actions bot locked as resolved and limited conversation to collaborators Jul 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type preprocessor #8021

Fix type preprocessor #8021

keewis commented Jul 31, 2020 •

edited

tk0miya left a comment

tk0miya Aug 1, 2020

keewis Aug 1, 2020 •

edited

tk0miya Aug 1, 2020

keewis Aug 1, 2020

tk0miya Aug 2, 2020

keewis Aug 4, 2020 •

edited

tk0miya Aug 4, 2020

keewis Aug 4, 2020

tk0miya Aug 4, 2020

tk0miya commented Aug 4, 2020

	if item.startswith("default") and item != "default":
	if re.match(r"^default[^_0-9a-zA-Z].*$", item):

Fix type preprocessor #8021

Fix type preprocessor #8021

Conversation

keewis commented Jul 31, 2020 • edited

tk0miya left a comment

Choose a reason for hiding this comment

tk0miya Aug 1, 2020

Choose a reason for hiding this comment

keewis Aug 1, 2020 • edited

Choose a reason for hiding this comment

tk0miya Aug 1, 2020

Choose a reason for hiding this comment

keewis Aug 1, 2020

Choose a reason for hiding this comment

tk0miya Aug 2, 2020

Choose a reason for hiding this comment

keewis Aug 4, 2020 • edited

Choose a reason for hiding this comment

tk0miya Aug 4, 2020

Choose a reason for hiding this comment

keewis Aug 4, 2020

Choose a reason for hiding this comment

tk0miya Aug 4, 2020

Choose a reason for hiding this comment

tk0miya commented Aug 4, 2020

keewis commented Jul 31, 2020 •

edited

keewis Aug 1, 2020 •

edited

keewis Aug 4, 2020 •

edited