You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
We raise Total number of concatenated rows exceeds size_type range instead of total size of output strings is too large for a cudf column when concatenating two large string columns.
We should catch this on the python layer to better inform the user about the error details than just throwing the generic row error.
Steps/Code to reproduce bug
importcudfnum_strings=1_000_000string_scale_f=100s_1=cudf.Series(['very long string '*string_scale_f]*num_strings)
s_2=cudf.Series(['very long string '*string_scale_f]*num_strings)
s_3=cudf.concat([s_1,s_2])
Expected behavior
I expect that concatenating string columns will raise the appropriate error.
Environment overview (please complete the following information)
Describe the bug
We raise
Total number of concatenated rows exceeds size_type range
instead oftotal size of output strings is too large for a cudf column
when concatenating two large string columns.We should catch this on the python layer to better inform the user about the error details than just throwing the generic row error.
Steps/Code to reproduce bug
Expected behavior
I expect that concatenating string columns will raise the appropriate error.
Environment overview (please complete the following information)
Environment details
The text was updated successfully, but these errors were encountered: