Release 0.14.0 #1247

mfeurer · 2021-09-14T12:52:40Z

No description provided.

* Fix determinism for few preprocessing techniques * Code formatting * Fix: model deserialization when mutual_info score is used for select_percentile/rates Co-authored-by: Rohit Agarwal <[email protected]>

…ocessing and feature type split (#977) * making data preprocessing step configurable with two options no preprocessing and feature type split * Fix: execution fails when data_preprocessor is no_preprocesing * Incorporating review comments * Fixing test cases; updating metalearning with updated hyperparameters * Fixing examples * Updating portfolios with new config * Incorporated review comments and fix test case * Test fixes * Test fixes * Fix metalearning config * Remove unused imports * Fix test cases * Fix test cases and examples * Adding more checks for include and exclude params * Fix flake error * Fix flake error * Handling target_type in datatset_properties * Fixes * Fixes * Fix error * Fix test cases * Adding datatype annotations * Fix test cases * Fix build * Fix test case' * Update stale.yaml * Fix annotation type * Update portfolios with new config Co-authored-by: Rohit Agarwal <[email protected]>

The function `def get_runhistory_models_performance(automl):` didn't use the parameter `automl` and instead used the global variable `cls`. This PR fixes that.

* Make backend deletion robust after fit * [FIX] smbo unittest * Output folder is needed for ensemble output * fit_ensemble also runs in manual spawning * fit_ensemble also runs in manual spawning * Rebase artifacts * Fix rebase * Missed arg in examples * Remove rebase artifacts Co-authored-by: Eddie Bergman <[email protected]>

…18.04 to 20.04 (#1198) * Updated docker distribution environment * Upgraded pytest workflow to use 20.04 * Updated values for 20.04

* Fix determinism for few preprocessing techniques * Code formatting * Fix: model deserialization when mutual_info score is used for select_percentile/rates Co-authored-by: Rohit Agarwal <[email protected]>

…ocessing and feature type split (#977) * making data preprocessing step configurable with two options no preprocessing and feature type split * Fix: execution fails when data_preprocessor is no_preprocesing * Incorporating review comments * Fixing test cases; updating metalearning with updated hyperparameters * Fixing examples * Updating portfolios with new config * Incorporated review comments and fix test case * Test fixes * Test fixes * Fix metalearning config * Remove unused imports * Fix test cases * Fix test cases and examples * Adding more checks for include and exclude params * Fix flake error * Fix flake error * Handling target_type in datatset_properties * Fixes * Fixes * Fix error * Fix test cases * Adding datatype annotations * Fix test cases * Fix build * Fix test case' * Update stale.yaml * Fix annotation type * Update portfolios with new config Co-authored-by: Rohit Agarwal <[email protected]>

The function `def get_runhistory_models_performance(automl):` didn't use the parameter `automl` and instead used the global variable `cls`. This PR fixes that.

* Make backend deletion robust after fit * [FIX] smbo unittest * Output folder is needed for ensemble output * fit_ensemble also runs in manual spawning * fit_ensemble also runs in manual spawning * Rebase artifacts * Fix rebase * Missed arg in examples * Remove rebase artifacts Co-authored-by: Eddie Bergman <[email protected]>

…18.04 to 20.04 (#1198) * Updated docker distribution environment * Upgraded pytest workflow to use 20.04 * Updated values for 20.04

…elopment

* Added doc and made function more legible * Removed a rogue 'self' in abstract method * Cleaned up _model_predict, now gives errors instead of warning * Added test to catch issue 1169 and possibly others * doc fix * typo fix in docs * Added expected output shape to the test * flake8'd * Requested changes * Readded logger with simplified context manager * Mypy and flake8 fixes

* Moved fit validation to AutoML fit and fit_pipeline * Added test for target and feature types * Added convert if sparse throughout * type fixes, func doc * flake8'd * Added docstring to AutoML.fit * flake8'd

* Fix and test for stratification of unique labels * Made sure number of samples if respected * Flake8'd * review changes, TODO -> Note, remove warning.call_count assertion * Fix sparse y (#1213) * Moved fit validation to AutoML fit and fit_pipeline * Added test for target and feature types * Added convert if sparse throughout * type fixes, func doc * flake8'd * Added docstring to AutoML.fit * flake8'd * flake8'd after merge * Fix and test for stratification of unique labels * Made sure number of samples if respected * Flake8'd * review changes, TODO -> Note, remove warning.call_count assertion * flake8'd after merge * removed dupe test

…1229) * Cleaned up some code in the documentation

* added performance over time plot as attribute

* m * csvs * util file * Added .gitattributes: * Added generate-baselines * update * Fixed branch envvar * Removed excess path part * Fix branch extract * Typo fix * Typo fixes * branch extract? * filename fix * path fix * path fix * fix tadodedoo * 1 step closer to going home * sigh...typo * regression workflow stuff * Updated to new flow * switched to baseline off development * Fix yaml * fix again ... * first sigh * Event fixes, second sigh * third sigh * Finding issues * message * narrow down? * maybe multiple jobs? * create-comment issue? * Now? * Fixed maybe * UPdated to seek from development branch * Yah * Fix branch extraction * fix branch extraction * m * Fixed branch extraction * Someday * Moved back to 1 dataset for testing * updated path * sigh typo fix * Typo fix * typo typo typo * reorganized faster steps to top * new branch9 * Dependancies and update util file * ... * soon * new branch12 * new branch13 * *sad song lyrics* * new branch14 * sigh * . * reorder * filenaming fix * . * Fixed extract branch, changed to https clone * -- * typo fix * fixes * Fixes * Fxi * update * Update pip * soon * more fixes * Fix 678 * Almost there * new branch33 * Some fixes and convience in artifacts * view content changes * Fix path names * . * ... * fix paths * path fixes * More fixes * Fix output * Re-included full tests * Cleanup * cleanup old test file * Deleted old test files * Changed param types * Updates * Update to use system python * test method of passing python version to setup python * Update all of them * Grab just numeric part of version

* Fixed module idempotent tests to match sklearns descriptions * Added pytest.ini so examples with "test" in name don't run * random_state now documented and used properly * flake8'd * Removed stray print statement * Fixed failing tests * Added testing for pipeline steps random_states * Added random_state to Pipelines that have fit called * flake8'd * Made mlp tests less sensitive to platform differences * review changes

…from 45m) (#1239) * Update total test time out to be 60min * Update total test time out to be 60min

* update * Update doc/installation.rst Co-authored-by: Matthias Feurer <[email protected]> * Update doc/installation.rst Co-authored-by: Matthias Feurer <[email protected]> Co-authored-by: Matthias Feurer <[email protected]>

to show that we're on the development branch.

* Fix rare edge case with extremely inbalanced data For dataset 360112 Auto-sklearn would fail because the data would first be sub-sampled and then contain some classes only once. In the internal splitting, the StratifiedShuffleSplit would not be able to split the dataset into train and valid, and would resort to only a ShuffleSplit. This could put the single sample for a class into the test set. At predict time we would then miss one class. This commit creates two new splitters which move a sample from the test split to the training split if a class does not exist in the train split. * fix unit test

codecov · 2021-09-14T13:33:08Z

Codecov Report

Merging #1247 (fb2ceb8) into master (4597152) will decrease coverage by 0.12%.
The diff coverage is 81.81%.

@@            Coverage Diff             @@
##           master    #1247      +/-   ##
==========================================
- Coverage   88.19%   88.06%   -0.13%     
==========================================
  Files         138      140       +2     
  Lines       10866    11144     +278     
==========================================
+ Hits         9583     9814     +231     
- Misses       1283     1330      +47

Impacted Files	Coverage Δ
autosklearn/data/feature_validator.py	`97.50% <ø> (ø)`
autosklearn/data/target_validator.py	`97.08% <ø> (ø)`
autosklearn/ensembles/abstract_ensemble.py	`88.88% <0.00%> (ø)`
autosklearn/evaluation/abstract_evaluator.py	`92.91% <ø> (ø)`
...osklearn/metalearning/metafeatures/metafeatures.py	`94.59% <ø> (ø)`
autosklearn/metrics/__init__.py	`91.27% <ø> (+0.05%)`	⬆️
autosklearn/pipeline/base.py	`87.67% <ø> (+1.10%)`	⬆️
autosklearn/pipeline/classification.py	`86.61% <ø> (-0.28%)`	⬇️
autosklearn/pipeline/components/base.py	`78.78% <ø> (-0.38%)`	⬇️
...arn/pipeline/components/classification/__init__.py	`84.78% <ø> (ø)`
... and 85 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4597152...fb2ceb8. Read the comment docs.

PGijsbers and others added 29 commits July 30, 2021 16:34

Fix typo and link formatting (#1195)

832b412

Fix determinism for few preprocessing techniques (#1172)

712b1af

* Fix determinism for few preprocessing techniques * Code formatting * Fix: model deserialization when mutual_info score is used for select_percentile/rates Co-authored-by: Rohit Agarwal <[email protected]>

Doc: Fix variable name in example (#1200)

d761cb3

The function `def get_runhistory_models_performance(automl):` didn't use the parameter `automl` and instead used the global variable `cls`. This PR fixes that.

Update pytest workflow and docker dist environements from Ubunut …

ca97162

…18.04 to 20.04 (#1198) * Updated docker distribution environment * Upgraded pytest workflow to use 20.04 * Updated values for 20.04

Fix typo and link formatting (#1195)

34534c7

Fix determinism for few preprocessing techniques (#1172)

90658e5

* Fix determinism for few preprocessing techniques * Code formatting * Fix: model deserialization when mutual_info score is used for select_percentile/rates Co-authored-by: Rohit Agarwal <[email protected]>

Doc: Fix variable name in example (#1200)

51f7eee

The function `def get_runhistory_models_performance(automl):` didn't use the parameter `automl` and instead used the global variable `cls`. This PR fixes that.

Update pytest workflow and docker dist environements from Ubunut …

39a5b29

…18.04 to 20.04 (#1198) * Updated docker distribution environment * Upgraded pytest workflow to use 20.04 * Updated values for 20.04

Merge branch 'development' of github.com:automl/auto-sklearn into dev…

c83171d

…elopment

Fix sparse y (#1213)

a38a5c3

* Moved fit validation to AutoML fit and fit_pipeline * Added test for target and feature types * Added convert if sparse throughout * type fixes, func doc * flake8'd * Added docstring to AutoML.fit * flake8'd

Update the code in manual and examples to be formatted more nicely (#…

2263e63

…1229) * Cleaned up some code in the documentation

Makes linkcheck not build the examples (#1231)

8d93737

New feature: performance over time curve (#1219)

f32d0c7

* added performance over time plot as attribute

Updated to use latest version of swig (#1242)

f859820

Updated docker PWD part of the command (#1235)

2fa3b32

Extends github action pytest to allow for tests to run for 60m (up …

f40beda

…from 45m) (#1239) * Update total test time out to be 60min * Update total test time out to be 60min

Updates pip install instructions (#1237)

ec0842a

* update * Update doc/installation.rst Co-authored-by: Matthias Feurer <[email protected]> * Update doc/installation.rst Co-authored-by: Matthias Feurer <[email protected]> Co-authored-by: Matthias Feurer <[email protected]>

Bump version number to 0.14.0dev

63808ef

to show that we're on the development branch.

Add sparse support for Auto-sklearn 2.0 (#1245)

a8effd8

Prepare new release 0.14.0 (#1246)

fb2ceb8

mfeurer merged commit 57463cf into master Sep 14, 2021

github-actions bot pushed a commit that referenced this pull request Sep 14, 2021

Matthias Feurer: Merge pull request #1247 from automl/development

e0c4ce9

github-actions bot pushed a commit to dumpmemory/auto-sklearn that referenced this pull request Sep 14, 2021

Matthias Feurer: Merge pull request automl#1247 from automl/development

0bc58b7

github-actions bot pushed a commit to iryna-savchuk/auto-sklearn that referenced this pull request Sep 28, 2021

Matthias Feurer: Merge pull request automl#1247 from automl/development

b56695b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 0.14.0 #1247

Release 0.14.0 #1247

mfeurer commented Sep 14, 2021

codecov bot commented Sep 14, 2021 •

edited

Loading

Release 0.14.0 #1247

Release 0.14.0 #1247

Conversation

mfeurer commented Sep 14, 2021

codecov bot commented Sep 14, 2021 • edited Loading

Codecov Report

codecov bot commented Sep 14, 2021 •

edited

Loading