False positive in redos detection when nested quantifier mutually exclusive #13

mschwager · 2020-01-10T22:37:44Z

"A group that contains a token with a quantifier must not have a quantifier of its own unless the quantified token inside the group can only be matched with something else that is mutually exclusive with it." (Nested Quantifiers)

Dlint does not currently eliminate safe regular expressions that have nested quantifiers but they're mutually exclusive. Consider the example from the above link:

$ python -m dlint.redos -p '(x\w{1,10})+y'
('(x\\w{1,10})+y', True)

Dlint finds the nested quantifier. But it flags the corrected code as well:

$ python -m dlint.redos -p '(x[a-wyz0-9_]{1,10})+y'
('(x[a-wyz0-9_]{1,10})+y', True)

This example is okay because there's no character overlap inside the nested quantifier. We should fix this false positive.

The text was updated successfully, but these errors were encountered:

remram44 · 2020-11-19T01:40:22Z

I ran into (ab+)+c which I think falls into this category:

$ python -m dlint.redos -p '(ab+)+c'
('(ab+)+c', True)

My actual regex is ^POLYGON ?$\([0-9 .]+$(, ?$[0-9 .]+$)*\)$ (for WKT) which I couldn't find any way to fix.

gsnedders mentioned this issue May 25, 2020

Patch for sanitizer.py needs to also be applied to _vendor/html5lib/filters/sanitizer.py mozilla/bleach#534

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

False positive in redos detection when nested quantifier mutually exclusive #13

False positive in redos detection when nested quantifier mutually exclusive #13

mschwager commented Jan 10, 2020

remram44 commented Nov 19, 2020

False positive in redos detection when nested quantifier mutually exclusive #13

False positive in redos detection when nested quantifier mutually exclusive #13

Comments

mschwager commented Jan 10, 2020

remram44 commented Nov 19, 2020