Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Change treatment of generic column type
object
(#1415)
* rename `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/feature_reduction` to `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/text_feature_reduction`. also rename corresponding feature reduction class FeatureReduction to TextFeatureReduction. `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/text_feature_reduction/truncated_svd.py:TextFeatureReduction` This includes adapting all *csv and *json participating in metalearning The "real" changes are limited to 1. truncated_svd.py 2. feature_type_text.py * rename `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/feature_reduction` to `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/text_feature_reduction`. also rename corresponding feature reduction class FeatureReduction to TextFeatureReduction. `auto-sklearn/autosklearn/pipeline/components/data_preprocessing/text_feature_reduction/truncated_svd.py:TextFeatureReduction` This includes adapting all *csv and *json participating in metalearning The "real" changes are limited to 1. truncated_svd.py 2. feature_type_text.py * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. add new test case to `test_feature_validator.py` * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. add new test case to `test_feature_validator.py` * change treatment of generic column dtype `object` for pandas dataframes. The `object` type will be treated as `string` in the future. add new test case to `test_feature_validator.py`
- Loading branch information