-
Notifications
You must be signed in to change notification settings - Fork 919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] need to support cudf.DataFrame([[‘a’,‘b’]]).corr() when set ‘method=spearman’, we only support ‘method=pearson’ so far #6804
Comments
This would be a welcome addition. This could be implemented (in cuDF Python) by combining Would you be interested in contributing this? |
love to .... but need your guidance ;) @beckernick |
Pearson correlation is currently the implementation of correlation in cuDF Python currently. Spearman correlation is defined mathematically as the pearson correlation of the ranks of the columns. Rank is implemented currently. This means we have everything needed in cuDF Python. There are several ways to do this. One way would be to use a branching statement inside The dataframe implementation would need to be tweaked less, as we use CuPy's pearson correlation for efficiency. So you'd need to pass cudf/python/cudf/cudf/core/column/numerical.py Lines 330 to 339 in de5577c
cudf/python/cudf/cudf/core/dataframe.py Lines 6789 to 6795 in de5577c
|
…rame-add-spearman [REVIEW]Add 'spearman' correlation matrix in dataframe.py rapidsai#6804
This issue has been marked stale due to no recent activity in the past 30d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be marked rotten if there is no activity in the next 60d. |
need more time to finish, holiday season in China ;) |
Closes #6804 Adds 'spearman' correlation method for `dataframe.corr` Authors: - https://github.com/dominicshanshan - Vyas Ramasubramani (https://github.com/vyasr) - Michael Wang (https://github.com/isVoid) Approvers: - Michael Wang (https://github.com/isVoid) - Nick Becker (https://github.com/beckernick) - https://github.com/brandon-b-miller URL: #7141
Is your feature request related to a problem? Please describe.
from time series analysis, need to support spearman correlation matrix calculation in cuDF
Describe the solution you'd like
similar like pandas.DataFrame().corr(method='spearman')
Additional context
is it possible to let me know this feature adding roadmap? big thanks !
The text was updated successfully, but these errors were encountered: