You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
A recent change in cudf regex parser return error for invalid quantifier per 575e944f777df0b904be0a8f1a9722ded0c045a6 fails some patterns that passed earlier that seem reasonable to pass in the first place.
Steps/Code to reproduce bug
Following tests fail :
C++ exception with description "cuDF failure at: /home/kuhus/Reps/cudf/cpp/src/strings/regex/regcomp.cpp:504: invalid regex pattern: nothing to repeat at position 6" thrown in the test body.
and,
C++ exception with description "cuDF failure at: /home/kuhus/Reps/cudf/cpp/src/strings/regex/regcomp.cpp:504: invalid regex pattern: nothing to repeat at position 3" thrown in the test body.
respectively. Expected behavior
These tests above passed before the change.
Environment overview (please complete the following information)
cudf 0.14 on Ubuntu 18.04
The text was updated successfully, but these errors were encountered:
Another example with the same error and conditions:
>>> s = cudf.Series(["0.0.0.0", "5.79.97.178"])
>>> reg=r"^(100\.(6[4-9]|[7-9][0-9]|1([0-1][0-9]|2[0-7]))\.([0-9]|[1-9][0-9]|1([0-9][0-9])|2([0-4][0-9]|5[0-5]))\.([0-9]|[1-9][0-9]|1([0-9][0-9])|2([0-4][0-9]|5[0-5]))$)"
>>> s.str.match(reg)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/conda/envs/cudf/lib/python3.7/site-packages/cudf/core/column/string.py", line 1736, in match
cpp_match_re(self._column, pat), **kwargs
File "cudf/_libxx/strings/contains.pyx", line 67, in cudf._libxx.strings.contains.match_re
RuntimeError: cuDF failure at: /cudf/cpp/src/strings/regex/regcomp.cpp:504: invalid regex pattern: nothing to repeat at position 80
Also, I was able to reproduce with the minimal pattern of `r"(9)|2" so it looks like the ")" plus quantifier is causing the problem.
Describe the bug
A recent change in cudf regex parser
return error for invalid quantifier
per575e944f777df0b904be0a8f1a9722ded0c045a6
fails some patterns that passed earlier that seem reasonable to pass in the first place.Steps/Code to reproduce bug
Following tests fail :
they thorw
and,
respectively.
Expected behavior
These tests above passed before the change.
Environment overview (please complete the following information)
cudf 0.14 on Ubuntu 18.04
The text was updated successfully, but these errors were encountered: