-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feedback on tidyup 006: Ordering of dplyr::group_by()
#21
Comments
Some thoughts:
Overall, I think this makes |
Overall, I think this seems like a great improvement, with minor drawbacks as @mgirlich mentioned. My two reactions are as follows:
|
@mine-cetinkaya-rundel yea I'm definitely thinking it will either go under Details or get it's own sub-section header like "Row Ordering", depending on how much we'd like to say about it. I agree that we wouldn't focus too much on it to avoid potentially confusing first time users, and I probably would not even include an example in the Examples section of the help page. |
Thanks all for your feedback! We will move forward with the proposed changes, and I will finalize the tidyup shortly. |
We’d love to get your thoughts on https://github.com/tidyverse/tidyups/blob/main/006-dplyr-group-by-ordering.md, a proposal to swap out the internal algorithm used by
group_by()
with one that is often more performant.The main potential issue with this adjustment is that we would be switching to using the C locale when ordering character grouping columns, where previously we used the system locale (through
order()
). As a reminder,group_by()
internally orders the group keys, so that the nextsummarize()
returns sorted group keys along with the computed summary columns. A result of the change proposed in this tidyup is that the order of those group keys would be different than before, since they would no longer respect the system locale.Please feel free to contribute however you feel comfortable — you're welcome to make small fixes via PR, comment here, or open bigger discussion topics in an new issue. If there are things you’d prefer to discuss in private, please feel free to email me. I’ll plan to close the discussion on September 15, so we can review and make adjustments as needed.
@markfairbanks, @mgirlich, @eutwt, @dgrtwo, @mine-cetinkaya-rundel
The text was updated successfully, but these errors were encountered: