-
Notifications
You must be signed in to change notification settings - Fork 540
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Support CPU object for
train_test_split
(#5873)
This PR adds support to CPU objects for `train_test_split`, leveraging the input conversion tools defined in `input_utils.py`. This PR also adds `output_to_df_obj_like` API that converts CumlArray back to a series/dataframe, matching metadata from input. In the meantime, this PR reimplements majority of `train_test_split` by centralizing indices compute and gather. This reduces the number of kernels launched, especially in the cases where stratify keys are provided. Closes #5619 Authors: - Michael Wang (https://github.com/isVoid) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: #5873
- Loading branch information
Showing
3 changed files
with
227 additions
and
254 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.