You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe. pd.Series.factorize returns codes of int64 dtype's always. cudf.Series.factorize returns code with int8 or any other appropriate type that saves memory usage. This is a feature request to return int64 always only in pandas-compatibility mode to match pandas API behavior.
The text was updated successfully, but these errors were encountered:
…ze` API (#13982)
closes#13981
This PR enables parity with pandas `factorize` API by returning `codes` with `int64` dtype only in pandas-compatibility mode. When the pandas-compatibility mode is turned off, `cudf` will calculate the appropriate dtype that needs to be returned to save memory usage.
Authors:
- GALI PREM SAGAR (https://github.com/galipremsagar)
Approvers:
- Matthew Roeschke (https://github.com/mroeschke)
URL: #13982
Is your feature request related to a problem? Please describe.
pd.Series.factorize
returns codes ofint64
dtype's always.cudf.Series.factorize
returns code withint8
or any other appropriate type that saves memory usage. This is a feature request to returnint64
always only in pandas-compatibility mode to match pandas API behavior.The text was updated successfully, but these errors were encountered: