You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
The regular expression [\a-z] does not get parsed correctly because we currently only support character ranges with non-escaped characters. The transpiler then converts this to [\a\-z] when passing it to cuDF, resulting in incorrect results.
Steps/Code to reproduce bug
This was found by the new test code in #4504
Expected behavior
This expression should produce the same results on CPU and GPU.
Additional context
None
The text was updated successfully, but these errors were encountered:
Related to this issue: with the newly added support of predefined character classes (eg. \s, \h, \v etc), we need to handle the cases where these are nested in another character class (eg. [abc\s]).
cuDF doesn't support nested character classes but we should be able to expand and flatten the nested predefined classes, ie. [abc\s] becomes [abc \t\n\x0B\f\r]
Describe the bug
The regular expression
[\a-z]
does not get parsed correctly because we currently only support character ranges with non-escaped characters. The transpiler then converts this to[\a\-z]
when passing it to cuDF, resulting in incorrect results.Steps/Code to reproduce bug
This was found by the new test code in #4504
Expected behavior
This expression should produce the same results on CPU and GPU.
Additional context
None
The text was updated successfully, but these errors were encountered: