-
Notifications
You must be signed in to change notification settings - Fork 917
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
describe
casts all values to str
for several types
#9900
Comments
Quoting @vyasr from #9867 (comment):
|
This issue has been labeled |
This issue has been labeled |
The
describe
implementations, especially for datetime and timedelta types, appear to be casting all the values tostr
(aside from the numeric implementation). This does not align with Pandas behavior:I recognize there is an issue here with different types, namely that
count
andfreq
are not of the same type asmean
,min
, percentiles, ormax
. This also affects numerical columns which will upcast integer values likecount
to floating types.Some options to resolve this (and their downsides):
str
(results are on GPU ...but data is not usable asstr
type).pd.DataFrame
ordict
that can have multiple types (not a GPU DataFrame).I propose changing behavior to adopt option (2), and return a
pd.DataFrame
. The summary doesn't really need to be a GPU DataFrame since it contains so few values. (Do we have precedent for this kind of behavior returning a CPU (Pandas) DataFrame?)Originally posted by @bdice in #9867 (comment)
The text was updated successfully, but these errors were encountered: