-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] .describe()
after DataFrameGroupBy
#7990
Comments
Looks like this is the equivalent of running |
Thank you for your reply! And I think these could satisfy my needs. I will close this issue. |
Reopening this as we should still support this feature. Glad to hear there's a nice workaround though! |
This issue has been labeled |
This is being actively worked on here: #8179 |
This PR implements functionality to generate summary statistics for ` Dataframe.groupby() ` operation via `.describe() ` method, similar to Pandas. ``` >>> import pandas as pd >>> pdf = pd.DataFrame({"Speed": [380.0, 370.0, 24.0, 26.0], "Score": [50, 30, 90, 80]}) >>> pdf Speed Score 0 380.0 50 1 370.0 30 2 24.0 90 3 26.0 80 >>> pdf.groupby('Score').describe() Speed count mean std min 25% 50% 75% max Score 30 1.0 370.0 NaN 370.0 370.0 370.0 370.0 370.0 50 1.0 380.0 NaN 380.0 380.0 380.0 380.0 380.0 80 1.0 26.0 NaN 26.0 26.0 26.0 26.0 26.0 90 1.0 24.0 NaN 24.0 24.0 24.0 24.0 24.0 >>> import cudf >>> gdf = cudf.from_pandas(pdf) >>> gdf.groupby('Score').describe() count mean std min 25% 50% 75% max Score 30 1 370.0 <NA> 370.0 370.0 370.0 370.0 370.0 50 1 380.0 <NA> 380.0 380.0 380.0 380.0 380.0 80 1 26.0 <NA> 26.0 26.0 26.0 26.0 26.0 90 1 24.0 <NA> 24.0 24.0 24.0 24.0 24.0 ``` Fixes: #7990 Authors: - Sheilah Kirui (https://github.com/skirui-source) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Ashwin Srinath (https://github.com/shwina) - Michael Wang (https://github.com/isVoid) - Christopher Harris (https://github.com/cwharris) URL: #8179
Is your feature request related to a problem? Please describe.
if I want to get
describe()
information after a.groupby()
operation. But I got:Describe the solution you'd like
When I execute this:
I would like to get results the same as
pandas
.Thank you very much! Feel free to make any comment about this issue.
The text was updated successfully, but these errors were encountered: