[BUG] text benchmark fails with thrust::system::system_error #10043

randerzander · 2022-01-13T19:51:57Z

Describe the bug
Running the libcudf TEXT_BENCH benchmark results in a thrust system error:

terminate called after throwing an instance of 'thrust::system::system_error'
  what():  copy_if failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered

Steps/Code to reproduce bug
Run the libcudf text benchmark

Expected behavior
The benchmark completes without error condition

Environment overview (please complete the following information)

Environment location: [Bare-metal, Docker, Cloud(specify cloud provider)]
Docker FROM 22.02-cuda11.5-devel-ubuntu20.04-py3.7
Method of cuDF install: Docker
If method of install is [Docker], provide docker pull & docker run commands used
After pulling the above nightly Docker image, I build & run the benchmark

Environment details
Please run and paste the output of the cudf/print_env.sh script here, to gather any other relevant environment details
https://gist.github.com/82542cb3e6c04b7a6b42d482f874400b

Additional context
Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

Closes #10043 A metadata singleton was allocated from the current rmm memory-resource inside the nvtext normalizer functions. If the memory resource is later changed, the metadata pointer may become invalidated. This PR removes the singleton pattern. The normalizer is used by the subword-tokenizer which is passed a vocabulary structure that is built only once and is maintained by the caller. The metadata has been added to this structure so it's lifetime can share the same scope. The normalizer can also be called directly through the `nvtext::normalize_characters` API. Here the metadata table (size about 1MB) is now created on each call. This showed only significant performance impact on benchmarks testing a small number (<50K) of shorter strings. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Nghia Truong (https://github.com/ttnghia) - Robert Maynard (https://github.com/robertmaynard) URL: #10090

randerzander added bug Something isn't working Needs Triage Need team to review and classify labels Jan 13, 2022

randerzander assigned davidwendt Jan 13, 2022

davidwendt mentioned this issue Jan 20, 2022

Remove metadata singleton from nvtext normalizer #10090

Merged

rapids-bot bot closed this as completed in #10090 Jan 26, 2022

bdice removed the Needs Triage Need team to review and classify label Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] text benchmark fails with thrust::system::system_error #10043

[BUG] text benchmark fails with thrust::system::system_error #10043

randerzander commented Jan 13, 2022

[BUG] text benchmark fails with thrust::system::system_error #10043

[BUG] text benchmark fails with thrust::system::system_error #10043

Comments

randerzander commented Jan 13, 2022