[BUG] cudf::repeat crashes with SIGFPE when count == 0 #13458

jlowe · 2023-05-26T15:02:31Z

Describe the bug
After #13323 calling cudf::repeat with a scalar count of 0 will crash the process with SIGFPE due to a divide by zero. The problem is an assertion was added that involves dividing by count, but count can be zero at that point which will result in a divide by zero.

Steps/Code to reproduce bug
Call cudf::repeat with a scalar count of zero.

Expected behavior
The process should not crash and instead an empty table should be returned when count == 0.

The text was updated successfully, but these errors were encountered:

jlowe · 2023-05-26T15:11:13Z

Looks like subword_tokenize can also do this if max_rows_tensor == 0 which is probably nonsensical but still nice to avoid a nasty SIGFPE. Side note, doesn't look like the max_rows_tensor value is otherwise used? It's passed to the wordpiece_tokenizer constructor which seems to just ignore it.

Removes the `max_rows_tensor` parameter is from the `nvtext::subword_tokenize` API since it is no longer required. The parameter was intended to size the temporary working memory for the internal functions. Since some general rework it was no longer used but never removed from the API. Also updates the Python/Cython calls which had been hard-coding a default value anyway. Reference issue #13458 found this issue. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Bradley Dice (https://github.com/bdice) - Divye Gala (https://github.com/divyegala) - Vyas Ramasubramani (https://github.com/vyasr) - Matthew Roeschke (https://github.com/mroeschke) URL: #13463

jlowe added bug Something isn't working Needs Triage Need team to review and classify libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS labels May 26, 2023

jlowe mentioned this issue May 26, 2023

[BUG] JVM agent crashed SIGFPE cudf::detail::repeat in integration tests NVIDIA/spark-rapids#8409

Closed

davidwendt self-assigned this May 26, 2023

This was referenced May 26, 2023

Fix cudf::repeat logic when count is zero #13459

Merged

Remove unused max_rows_tensor parameter from subword tokenizer #13463

Merged

GPUtester closed this as completed in 5541e64 May 30, 2023

bdice removed the Needs Triage Need team to review and classify label Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] cudf::repeat crashes with SIGFPE when count == 0 #13458

[BUG] cudf::repeat crashes with SIGFPE when count == 0 #13458

jlowe commented May 26, 2023 •

edited

Loading

jlowe commented May 26, 2023

[BUG] cudf::repeat crashes with SIGFPE when count == 0 #13458

[BUG] cudf::repeat crashes with SIGFPE when count == 0 #13458

Comments

jlowe commented May 26, 2023 • edited Loading

jlowe commented May 26, 2023

jlowe commented May 26, 2023 •

edited

Loading