You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Implement `OrdinalEncoder`.
- Implement dask version.
- Fix dask transformers with DataFrame input by using `dask_cudf` to construct return df.
Some other scikit-learn features are not available yet, for instance, `encoded_missing_value`, `min_frequency`, and `max_categories`.
The implementation is mostly based on the existing one hot encoder and label encoder.
I'm a bit confused by the `output_type` parameter and not sure how strictly it's enforced. I looked around, it seems some estimators can ignore this parameter in their returns. Would be great if there's a guideline on how to handle this parameter, along with #5645 .
Close#4456 .
Authors:
- Jiaming Yuan (https://github.com/trivialfis)
- Simon Adorf (https://github.com/csadorf)
Approvers:
- Simon Adorf (https://github.com/csadorf)
URL: #5646
scikit-learn has a method for transformers to set the output type, similar to what
cuml
does with theoutput_type
parameter. See https://scikit-learn.org/stable/auto_examples/miscellaneous/plot_set_output.html for details.The text was updated successfully, but these errors were encountered: