[FEA] Support `set_output` for transformers. #5645

trivialfis · 2023-11-06T21:09:02Z

scikit-learn has a method for transformers to set the output type, similar to what cuml does with the output_type parameter. See https://scikit-learn.org/stable/auto_examples/miscellaneous/plot_set_output.html for details.

The text was updated successfully, but these errors were encountered:

- Implement `OrdinalEncoder`. - Implement dask version. - Fix dask transformers with DataFrame input by using `dask_cudf` to construct return df. Some other scikit-learn features are not available yet, for instance, `encoded_missing_value`, `min_frequency`, and `max_categories`. The implementation is mostly based on the existing one hot encoder and label encoder. I'm a bit confused by the `output_type` parameter and not sure how strictly it's enforced. I looked around, it seems some estimators can ignore this parameter in their returns. Would be great if there's a guideline on how to handle this parameter, along with #5645 . Close #4456 . Authors: - Jiaming Yuan (https://github.com/trivialfis) - Simon Adorf (https://github.com/csadorf) Approvers: - Simon Adorf (https://github.com/csadorf) URL: #5646

trivialfis added feature request New feature or request ? - Needs Triage Need team to review and classify labels Nov 6, 2023

trivialfis mentioned this issue Nov 7, 2023

Basic implementation of OrdinalEncoder. #5646

Merged

beckernick added the cuml-cpu label Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support `set_output` for transformers. #5645

[FEA] Support `set_output` for transformers. #5645

trivialfis commented Nov 6, 2023

[FEA] Support set_output for transformers. #5645

[FEA] Support set_output for transformers. #5645

Comments

trivialfis commented Nov 6, 2023

[FEA] Support `set_output` for transformers. #5645

[FEA] Support `set_output` for transformers. #5645