Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable codes dtype parity in pandas-compatibility mode for factorize API #13982

Merged
merged 1 commit into from
Aug 28, 2023

Conversation

galipremsagar
Copy link
Contributor

Description

closes #13981

This PR enables parity with pandas factorize API by returning codes with int64 dtype only in pandas-compatibility mode. When the pandas-compatibility mode is turned off, cudf will calculate the appropriate dtype that needs to be returned to save memory usage.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@galipremsagar galipremsagar added 3 - Ready for Review Ready for review by team Python Affects Python cuDF API. 4 - Needs cuDF (Python) Reviewer improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Aug 28, 2023
@galipremsagar galipremsagar self-assigned this Aug 28, 2023
@galipremsagar galipremsagar requested a review from a team as a code owner August 28, 2023 06:10
@galipremsagar galipremsagar added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team 4 - Needs cuDF (Python) Reviewer labels Aug 28, 2023
@galipremsagar
Copy link
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit f9e35c7 into rapidsai:branch-23.10 Aug 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Parity between codes returned by factorize API with pandas
2 participants