[FEA] Decimal 128 Support: MurMur3 and HashPartitioning support. #3878
Labels
cudf_dependency
An issue or PR with this label depends on a new feature in cudf
P1
Nice to have for release
task
Work required that improves the product but is not user facing
Milestone
Spark hashes Decimal 128 values very differently from decimal 64 and below. The existing MurMur3 hashing in CUDF is not correct for Spark. Once it is fixed we need to update the tests and make sure it is working. Note that there are a number of integration tests in hash aggregate around distinct counts and such that cannot include decimal 128 values in them because this functionality is missing and the shuffle falls back to the CPU. As a part of this it would be nice to enable those tests, or have a follow on issue to enable them.
The text was updated successfully, but these errors were encountered: