You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Running the libcudf TEXT_BENCH benchmark results in a thrust system error:
terminate called after throwing an instance of 'thrust::system::system_error'
what(): copy_if failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered
Steps/Code to reproduce bug
Run the libcudf text benchmark
Expected behavior
The benchmark completes without error condition
Environment overview (please complete the following information)
Environment location: [Bare-metal, Docker, Cloud(specify cloud provider)]
Docker FROM 22.02-cuda11.5-devel-ubuntu20.04-py3.7
Method of cuDF install: Docker
If method of install is [Docker], provide docker pull & docker run commands used
After pulling the above nightly Docker image, I build & run the benchmark
Closes#10043
A metadata singleton was allocated from the current rmm memory-resource inside the nvtext normalizer functions. If the memory resource is later changed, the metadata pointer may become invalidated. This PR removes the singleton pattern.
The normalizer is used by the subword-tokenizer which is passed a vocabulary structure that is built only once and is maintained by the caller. The metadata has been added to this structure so it's lifetime can share the same scope.
The normalizer can also be called directly through the `nvtext::normalize_characters` API. Here the metadata table (size about 1MB) is now created on each call. This showed only significant performance impact on benchmarks testing a small number (<50K) of shorter strings.
Authors:
- David Wendt (https://github.com/davidwendt)
Approvers:
- Nghia Truong (https://github.com/ttnghia)
- Robert Maynard (https://github.com/robertmaynard)
URL: #10090
Describe the bug
Running the libcudf TEXT_BENCH benchmark results in a thrust system error:
Steps/Code to reproduce bug
Run the libcudf text benchmark
Expected behavior
The benchmark completes without error condition
Environment overview (please complete the following information)
Docker FROM 22.02-cuda11.5-devel-ubuntu20.04-py3.7
docker pull
&docker run
commands usedAfter pulling the above nightly Docker image, I build & run the benchmark
Environment details
Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailshttps://gist.github.com/82542cb3e6c04b7a6b42d482f874400b
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: