You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
When replacing the pattern \\b (word boundary) with X for the string a\nb, Python and Java produce XaX\nXbX and cuDF produces XXXa\nb.
andygrove
changed the title
[BUG] Support replacing word boundaries in regexp replace in way that is compatible with Python/Java
[FEA] Support replacing word boundaries in regexp replace in way that is compatible with Python/Java
Dec 22, 2021
Closes#9950
Fixes matching a single word-boundary (BOW) regex pattern. This pattern will match word boundaries and not any actual characters. This means the `(begin,end)` position values will be equal. The replace code was always expecting `begin < end` character range to replace. The logic has been updated to allow for this case.
Additional gtests have been added that include a single `\b` pattern character.
Authors:
- David Wendt (https://github.com/davidwendt)
Approvers:
- Bradley Dice (https://github.com/bdice)
- MithunR (https://github.com/mythrocks)
URL: #9997
Is your feature request related to a problem? Please describe.
When replacing the pattern
\\b
(word boundary) withX
for the stringa\nb
, Python and Java produceXaX\nXbX
and cuDF producesXXXa\nb
.Describe the solution you'd like
I would like the ability to match Python/Java behavior in this case.
Describe alternatives you've considered
None
Additional context
None
The text was updated successfully, but these errors were encountered: