You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is because we currently have our small str optimization set to kick in at 16 bytes (which is how long the strings are in the multi-col str groupby) but this is the max number of bits we can have before we hit the totalDigits > 8 case, where we hash everything anyway. So these are all resulting in extra processing just to do what we were already doing
The text was updated successfully, but these errors were encountered:
This PR (closesBears-R-Us#2886) caps strs for the small str optimization at 8 bytes since there was a drop off in our str groupby benchmarks
This is because we currently have our small str optimization set to kick in at 16 bytes (which is how long the strings are in the multi-col str groupby benchmark) but this is the max number of bits we can have before we hit the totalDigits > 8 case, where we hash everything anyway. So these are all resulting in extra processing just to do what we were already doing
* Closes#2886: Cap small strs groupby optimization
This PR (closes#2886) caps strs for the small str optimization at 8 bytes since there was a drop off in our str groupby benchmarks
This is because we currently have our small str optimization set to kick in at 16 bytes (which is how long the strings are in the multi-col str groupby benchmark) but this is the max number of bits we can have before we hit the totalDigits > 8 case, where we hash everything anyway. So these are all resulting in extra processing just to do what we were already doing
* putting back in 1 col shortcut
---------
Co-authored-by: Pierce Hayes <[email protected]>
cap strs for the small str optimization at 8 bytes. There was a drop off in our str groupby benchmarks
This is because we currently have our small str optimization set to kick in at 16 bytes (which is how long the strings are in the multi-col str groupby) but this is the max number of bits we can have before we hit the
totalDigits > 8
case, where we hash everything anyway. So these are all resulting in extra processing just to do what we were already doingThe text was updated successfully, but these errors were encountered: