Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Support set_output for transformers. #5645

Open
trivialfis opened this issue Nov 6, 2023 · 0 comments
Open

[FEA] Support set_output for transformers. #5645

trivialfis opened this issue Nov 6, 2023 · 0 comments
Labels
? - Needs Triage Need team to review and classify cuml-cpu feature request New feature or request

Comments

@trivialfis
Copy link
Member

scikit-learn has a method for transformers to set the output type, similar to what cuml does with the output_type parameter. See https://scikit-learn.org/stable/auto_examples/miscellaneous/plot_set_output.html for details.

@trivialfis trivialfis added feature request New feature or request ? - Needs Triage Need team to review and classify labels Nov 6, 2023
rapids-bot bot pushed a commit that referenced this issue Nov 21, 2023
- Implement `OrdinalEncoder`.
- Implement dask version.
- Fix dask transformers with DataFrame input by using `dask_cudf` to construct return df.

Some other scikit-learn features are not available yet, for instance, `encoded_missing_value`, `min_frequency`, and `max_categories`.

The implementation is mostly based on the existing one hot encoder and label encoder.

I'm a bit confused by the `output_type` parameter and not sure how strictly it's enforced. I looked around, it seems some estimators can ignore this parameter in their returns. Would be great if there's a guideline on how to handle this parameter, along with #5645 .

Close #4456 .

Authors:
  - Jiaming Yuan (https://github.com/trivialfis)
  - Simon Adorf (https://github.com/csadorf)

Approvers:
  - Simon Adorf (https://github.com/csadorf)

URL: #5646
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
? - Needs Triage Need team to review and classify cuml-cpu feature request New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants