Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Put the GPU data back on host before processing cache on CPU #2970

Merged
merged 3 commits into from
Jul 22, 2021

Conversation

razajafri
Copy link
Collaborator

@razajafri razajafri commented Jul 20, 2021

If DataType isn't supported by ParquetCachedBatchSerializer on the GPU but the data is already on the GPU, we should copy it back to the host before writing it to cache otherwise the plugin will throw an exception.

Fixes #2942

Signed-off-by: Raza Jafri [email protected]

@razajafri
Copy link
Collaborator Author

build

Copy link
Contributor

@jlowe jlowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If DataType isn't supported on the GPU and the data is already on the GPU

This sentence is confusing. If the data type isn't supported by the GPU, how did it get on the GPU? I assume this is a case where the cache serializer doesn't support the type on the GPU but other operators do.

Adding a test would help explain what's going on and also help avoid regressions.

@razajafri razajafri requested a review from jlowe July 21, 2021 18:19
@jlowe jlowe changed the title Put the gpu data back on host before processing cache on CPU Put the GPU data back on host before processing cache on CPU Jul 21, 2021
@razajafri
Copy link
Collaborator Author

build

@sameerz sameerz added the bug Something isn't working label Jul 21, 2021
@sameerz sameerz added this to the July 19 - July 30 milestone Jul 21, 2021
@razajafri razajafri merged commit 8cdf2d9 into NVIDIA:branch-21.08 Jul 22, 2021
@razajafri razajafri deleted the cache-array-crash branch July 22, 2021 00:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] Cache of Array using ParquetCachedBatchSerializer failed with "DATA ACCESS MUST BE ON A HOST VECTOR"
3 participants