-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove workarounds from Catboost incompatibility with string categories #4051
Comments
The string categories problem seems to have been fixed. The fix was added into catboost via catboost/catboost#2096, and we're already on catboost 1.1.1, so we should confirm that the problem with string categories as we ran into it no longer exists and remove the handlings put in place to avoid it. We should check if support for float categories was added, and if so, remove all handlings. Otherwise, just use the string conversion. We should consider #3973 when implementing this, as |
I realized we were on catboost 1.1.1 at the time I made this ticket, so I looked into more specifics. The bugfix seems to have covered data with the Things I've learned
|
Catboost 1.2.0 was just released with fixes to the string categories bug |
Catboost currently has an incompatibility with columns that have the
category
dtype andstring
categories catboost/catboost#1965. Because of this, we have two workarounds in place that we should remove whenever this issue is resolved._ExtractFeaturesWithTransformPrimitives
, so we convert toobject
dtype (here) and reinitialize woodwork to change the string categories produced by the primitives toobject
categories that catboost can handle.apply
to do that change as noted in Use.apply
to change categories' dtype inhandle_float_categories_for_catboost
#3973, which will also be a nicer way to do this.The text was updated successfully, but these errors were encountered: