[FEA] Java bindings for drop_duplicates #9115
Labels
feature request
New feature or request
Java
Affects Java cuDF API.
Spark
Functionality that helps Spark RAPIDS
Is your feature request related to a problem? Please describe.
As part of implementing batched full joins in Spark, we need the ability to concatenate join gather maps together and remove any duplicate entries. libcudf has a
drop_duplicates
function but this has not been exposed in the Java APIs.Describe the solution you'd like
The cudf Java bindings should add a
dropDuplicates
method toTable
with the corresponding options for what to keep, null ordering, and null equality.The text was updated successfully, but these errors were encountered: