Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Investigate fast-path for hash joins that bypasses row operators #16026

Open
GregoryKimball opened this issue Jun 13, 2024 · 1 comment
Labels
feature request New feature or request libcudf Affects libcudf (C++/CUDA) code.

Comments

@GregoryKimball
Copy link
Contributor

Is your feature request related to a problem? Please describe.
For join inputs that use a single integer keying column, it's likely we could realize a performance benefit from replacing the hashing row operator with a cuco hasher and replacing the equality row operator with a bitwise comparator.

Since single integer keying columns are common in hash joins, let's test this approach and measure the performance impact.

Describe the solution you'd like
Just testing for now.

Describe alternatives you've considered
Continue to use the libcudf row operators as designed.

Additional context
This idea originally came up during distinct-key join development (#14948), and should apply to all hash-based join algorithms in libcudf.

@GregoryKimball GregoryKimball added feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. labels Jun 13, 2024
@GregoryKimball
Copy link
Contributor Author

#15700 could solve this issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request libcudf Affects libcudf (C++/CUDA) code.
Projects
None yet
Development

No branches or pull requests

1 participant