Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Execute UDFs that provide a RAPIDS execution path #1351

Closed
3 tasks done
jlowe opened this issue Dec 9, 2020 · 1 comment
Closed
3 tasks done

[FEA] Execute UDFs that provide a RAPIDS execution path #1351

jlowe opened this issue Dec 9, 2020 · 1 comment
Assignees
Labels
epic Issue that encompasses a significant feature or body of work feature request New feature or request P0 Must have for release SQL part of the SQL/Dataframe plugin

Comments

@jlowe
Copy link
Member

jlowe commented Dec 9, 2020

Is your feature request related to a problem? Please describe.
Some users may wish to provide a UDF that has both a CPU and a GPU implementation, the latter executing directly on the RAPIDS cudf ColumnVector instances just as the plugin operators do. This has a significant drawback in that the user has to implement the UDF twice, but it should provide the best performance when running with the RAPIDS Accelerator.

Describe the solution you'd like
Ideally it would be nice if the user could simply add an interface to their existing UDF class that provides one or more methods for implementing a RAPIDS execution path for the UDF. The plugin would automatically detect the user's UDF implements the required RAPIDS execution interfaces and allow it to execute on the GPU rather than requiring a transition back to the CPU.

Ideally the implementation should provide:

Describe alternatives you've considered
We could implement a completely separate interface, specific to the plugin, for registering UDFs. This is less ideal since it's not as seamless.

@jlowe jlowe added feature request New feature or request SQL part of the SQL/Dataframe plugin P0 Must have for release labels Dec 9, 2020
@jlowe jlowe self-assigned this Dec 9, 2020
@sameerz sameerz added the epic Issue that encompasses a significant feature or body of work label Dec 17, 2020
@sameerz sameerz added this to the Jan 4 - Jan 15 milestone Dec 21, 2020
@jlowe
Copy link
Member Author

jlowe commented Jan 13, 2021

Features have been completed, closing this epic.

@jlowe jlowe closed this as completed Jan 13, 2021
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Issue that encompasses a significant feature or body of work feature request New feature or request P0 Must have for release SQL part of the SQL/Dataframe plugin
Projects
None yet
Development

No branches or pull requests

2 participants