-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Add Pandas 2.0 support #5466
Comments
We haven't quite yet tested things with Pandas 2.0, but in general most things in cuML should work since all we do with Pandas inputs is transfer them to GPU. Will do some testing and debug any issues for the current development version (23.08). |
@dantegd I'm mostly interested in PyArrow data types support. Employing metadata such as this one makes imputation and data handling much easier and robust. |
Hi everyone I also tried using cuml with pandas 2.0.3. I already get an error when trying to import cuml:
Seems like this is related to the deprecated 'is_extension_type' in pandas: Unfortunately I'm limited to using the (stable) pip version of cuml (23.06) since I only have cude 12 available on my server - so I can't tell if it works with the nightly 23.08a. |
I found this problem again |
@dantegd Any progress in this direction? |
@dantegd Hi, I have had this problem independently in two separate environments now, which means it should probably be extremely high priority for you guys, e.g., it is likely the primary thing bottlenecking (most people will not want to use the extension if the first thing that happens when they try it is an error, if they are using pandas >= 2.0, which most people are) |
The primary limitation for cuml supporting pandas 2 is via a transitive dependency from cudf. cudf 24.04 will support pandas 2, so cuml should work with pandas 2 as well in the next release. @dantegd not sure if you want to close this issue now that rapidsai/cudf#14916 is merged or if you want to wait until the 24.04 release. |
Was there any progress? I am experiencing the same issue |
Much appreciated. |
Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I wish I could use cuML to do [...]
Describe the solution you'd like
v2.0 has been officially released and it should be supported as well.
Describe alternatives you've considered
cudf
is based on PyArrow too, but it doesn't fit the bill I'm afraid.Additional context
Add any other context, code examples, or references to existing implementations about the feature request here.
The text was updated successfully, but these errors were encountered: