-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] cuDF str.split performance appears to be orders of magnitude slower than Pandas. #4885
Comments
It seems libcudf is taking most of the time
|
Can you try with the pool allocator? This looks like a situation where allocating 400 columns is likely the bottleneck, but I could be mistaken. |
cc @davidwendt |
Tried with pool allocator, and problem still persists
|
Also, this is a single string (one row) and therefore only a single GPU thread is used to split it. There is no parallelism used here and so this will run much slower than on the CPU. The GPU would likely have an advantage over the CPU here only when 1000s or 10000s strings are being processed in parallel. |
Yea I understand that, but ~47s seems crazy excessive, no? |
My observations are that this is not the case. You can check by simply setting the range argument from 1 to to however many N strings you want. For example, with N=1000, and reducing the number of linear steps to 200 (with 400 it runs much longer): N=10,000, steps=200 |
I see that! Wow. Thanks for the code snippet here Devin. This will be very helpful. I should be able to profile this to see what is going on. |
Describe the bug
A clear and concise description of what the bug is.
cuDF Series str.split performance appears to be orders of magnitude slower than Pandas.
Steps/Code to reproduce bug
Follow this guide http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports to craft a minimal bug report. This helps us reproduce the issue you're having and resolve the issue more quickly.
Expected behavior
A clear and concise description of what you expected to happen.
Performance should be more closely aligned with Pandas.
Environment overview (please complete the following information)
docker pull
&docker run
commands usedBare-metal with Conda environment + Conda based cuDF installation
Environment details
Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsAdditional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: