You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In Spark, the pattern of regexp_extract isn't required to fully match the input string. Just as what extract means, regexp_extract finds any substring which matches the pattern and extracts them. However, GpuRegExExtract requires a full match on pattern.
Describe the bug
regexp_extract
isn't required to fully match the input string. Just as what extract means, regexp_extract finds any substring which matches the pattern and extracts them. However, GpuRegExExtract requires a full match on pattern.GPU result:
Array([], [], [], [], [], [], [], [], [], [])
CPU result:
Array([a], [a], [a], [a], [a], [a], [a], [a], [a], [a])
CPU run:
regexp_extract('123abcEfg', '([0-9]+)[a-z]+([A-Z])', 0)
=>123abcE
CPU run:
regexp_extract('123abcEfg', '([0-9]+)[a-z]+([A-Z])', 0)
=>123abcEfg
BTW, the issue is originated from #5088.
The text was updated successfully, but these errors were encountered: