-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Significant regression of groupby when threading #2735
Comments
same on
|
I have first benchmarked
So we could leave things as is or add something like |
for |
First - this is a general problem Second - it seems that we do not distribute work among threads correctly. Here is an example on four threads (I have enabled printing which thread was spawned):
as you can see one thread (in this case 2) is overloaded and the distribution of work among threads is not correct. Here is another example (now printing is disabled) - I want to show a skew in the load distribution. Four threads:
the same with 2 threads:
|
One more example showing super strange behavior:
|
Here are timings:
We have a problem with your new macro I think @nalimilan. Maybe it splits the data into too small chunks? This is hard 😞. Can you please look at it as you have implemented this part? Otherwise I can check - please let me know.
The text was updated successfully, but these errors were encountered: