Release candidate v0.1.17rc1 #663

bouthilx · 2021-09-14T17:05:54Z

🏗 Enhancements

Revert to default inf max_trials and pool-size of 1 @bouthilx (Revert to default inf max_trials and pool-size of 1 #659)
Support to configure executor for benchmark @donglinjy (Support to configure executor for benchmark #634)

🐛 Bug Fixes

Fix grouping of plots in legend @lebrice (Fix grouping of plots in legend #662)
Support linearize log integer properly @bouthilx (Support linearize log integer properly #658)
Fix shape argument in categorical dims @bouthilx (Fix shape argument in categorical dims #654)
Use user_script_config from parser in EVC @bouthilx (Use user_script_config from parser in EVC #655)
Fix TPE sampling in narrow spaces @bouthilx (Fix TPE sampling in narrow spaces #650)
Handle repo with invalid HEAD state @bouthilx (Handle repo with invalid HEAD state #645)
List EVC tree once only when querying with name @bouthilx (List EVC tree once only when querying with name #642)
Handle properly all types in config during branch @bouthilx (Handle properly all types in config during branch #641)

📜 Documentation

Clear error message for dup branching error @bouthilx (Clear error message for dup branching error #652)
Add missing documentation for EVC enable option @bouthilx (Add missing documentation for EVC enable option #651)

Merge back master in develop after release

Why: During resolution of conflicts, the user's script configuration file is parsed for any marked resolution. This parsing is not handling properly values that may not be string and causes Oríon to crash. How: The values should only be handled if they are string and contain the value '~' as this is the only way to mark a resolution. They can be safely ignored otherwise.

Why: Starting with v0.1.16 the EVC is disabled by default. It must be activated for the tests starting from v0.1.16.

Handle properly all types in config during branch

Why: The children should not be listed multiple times. If they appear in the EVC tree of some parent experiment then they should not be listed as a root as well. How: Problem was that all experiments matching the name would be fetched and printed with their tree. When name is specified, version queried should be 1 by default unless specified by the user.

Why: If the HEAD of the repo is in an invalid state (no branch, no commit), gitpython will crash when attempting to fetch the information required for the EVC. How: First check if HEAD state is valid, if not raise warning and ignore repo.

List EVC tree once only when querying with name

Handle repo with invalid HEAD state

Support to configure executor for benchmark

Why: The GMM for real values could not sample efficiently in narrow spaces. It would often lead to RuntimeError because the number of attempts allowed would be exhausted. We could increase the default number of attempts allowed but that would increase the computational cost for any space, even those easy to sample. How: Use numpy array to avoid playing with a list. It is more efficient. Also, increase the number of attempts as needed until it reaches a max value of attempts. This way easy samples do not take more time while difficult ones are allowed more.

Fix TPE sampling in narrow spaces

Add missing documentation for EVC enable option

Why: When trying to branch from an experiment that already has a child with the same name, Oríon will crash with a RaceCondition error. The problem is that this issue and a real race-condition are indiscernible as they lead to the same state. The only thing we can do is clarify the error message and warn that this error can also be caused by branching from a parent experiment that already has a child with the same name. Note: This issue only arises if the user specifies the version of the parent experiment, otherwise the EVC will use the child or branch from the child without any issue.

Why: The shape of categorical dimensions was not included in `get_prior_string`, causing the lost of the shape during branching. How: Add the shape to `get_prior_string` of Categorical dimensions and add tests to catch this issue. Also add tests for Conflict and Resolutions of priors with different shape. The adaptor for a change of prior does not raise an issue anymore when shapes are different and rather logs a warning. The trials are all ignored is this case.

Why: The option `user_script_config` is part of the worker configuration, not the EVC. As such, this option is not part of `branching` group of option and does not find its way to the Conflict objects of the EVC. The value of user_script_config is already available anyway inside the experiment configuration, so there is no need to pass it. Furthermore, it may differ from past experiments to the new one, and both experiments should be handled based on their respective `user_script_config`, not the new one. For these reasons, it is better to use the information available in the experiment configurations.

Use user_script_config from parser in EVC

…_dup Clear error message for dup branching error

Fix shape argument in categorical dims

Why: A log integer dimension would be casted to real, linearized, then casted back to integer. This reduces dramatically the number of possible values that can be sampled as many exp(int(log(x))) will result in the same integer for many different values of x. An algorithm that needs linearization should be able to handle real space or otherwise state a requirement for integers. This should be handled separately and quantization of linearized log integer should not be applied by default. Note: Due to the use of floor instead of rounding in Quantize, the values of int(exp(log(x))) would still clash for close values of x. Using rounding instead solves the issue. Rounding may be problematic however for algorithms that require integer type, as the rounding may cast real integers to values that are out-of-bound. For now there are no foreseeable algorithms that may require integer type so I avoid fixing the issue and leave it for later if the need even arises (which I highly doubt).

Support linearize log integer properly

Why: The new default behavior is confusing for users. It is also difficult to determine a good default max_trials, so having not enough or to many trials sampled by default at the start of HPO can be annoying for many users. Using inf by default and iterating with pool-size may be the best alternative. Now that we have a support for n-workers, the argument pool-size we previously deprecated actually make sense. By default, pool-size should be equal to number of workers. We have n-workers set to 1 by default, so by default we are back to previous behavior; sampling 1 trial at a time, until max_trials. How: The producer now takes a pool size as argument when producing. The same applies to ExperimentClient.suggest() and ExperimentClient.workon(). The pool size is used to sample multiple trials at a time and increase I/O efficiency. The producer now keeps track of number of new trials so that if multiple workers are producing new trials with a non-seed algorithm (hence they produce different trials and there are no conflicts leading to backoff) they will stop if they generated together up to `pool_size` trials. Note: Pool-size is moved to to worker configuration instead. Since pool-size relates to n-workers, which is part of worker configuration, having pool-size in worker configuration makes more sense.

Why: There was a bug in the tests. The functions to generate trials would generate more than requested because of the new behavior of producer attempting to produce all trials at once, once the value of `max_trials` was conflicting with the number of trials requested to the trial generating function for tests (`orion.testing.evc.generate_trials`). Fortunately the bug in the tests did not seem to miss any bugs in the code they were testing. How: Adjust the expected numbers based on the corrected behiavor. The numbers make indeed more sense now.

Signed-off-by: Fabrice Normandin <[email protected]>

Co-authored-by: Lin Dong <[email protected]>

… into feature/back_to_pool_size

Revert to default inf max_trials and pool-size of 1

Fix grouping of plots in legend

donglinjy and others added 30 commits August 17, 2021 17:15

Support to configure executor for benchmark

cb4edf3

consolidate parameter n_workers and testing code

5032cc1

Update backward comp test versions

801339c

Merge pull request #640 from Epistimio/ci/sync_master_back_to_dev

e3ea69a

Merge back master in develop after release

isort

4cedc07

Add pytest lazy fixture dependency

c1cc15c

Fix backward compatibility tests for EVC

f2ba6c6

Why: Starting with v0.1.16 the EVC is disabled by default. It must be activated for the tests starting from v0.1.16.

Fix backward test version for EVC

6c69adf

Avoid side effects in fixtures

13052a0

With conflict with new_config fixture

e6c8544

black

01bb166

Merge pull request #641 from bouthilx/hotfix/config_conflict_any_type

5099fe9

Handle properly all types in config during branch

Handle repo with invalid HEAD state

96c4c80

Why: If the HEAD of the repo is in an invalid state (no branch, no commit), gitpython will crash when attempting to fetch the information required for the EVC. How: First check if HEAD state is valid, if not raise warning and ignore repo.

Merge pull request #642 from bouthilx/hotfix/list_same_names

5a5fbcb

List EVC tree once only when querying with name

Merge pull request #645 from bouthilx/hotfix/handle_invalid_repo

266081f

Handle repo with invalid HEAD state

adjust test cases

116a509

fix executor number warning message

4c291bf

Merge pull request #634 from donglinjy/configure-benchmark-executor

08a90f0

Support to configure executor for benchmark

Using single value, not list when recursively sampling

80e4cd2

Add missing documentation for EVC enable option

74b2bc6

Merge pull request #650 from bouthilx/hotfix/tpe_narrow_sampling

9a7a6e4

Fix TPE sampling in narrow spaces

Merge pull request #651 from bouthilx/doc/enable_evc

9e9b7dd

Add missing documentation for EVC enable option

isort

7ec67e1

Add missing fixture

8792976

bouthilx and others added 17 commits September 7, 2021 13:57

Blackify

f168573

Merge pull request #655 from bouthilx/hotfix/log_user_script_config

95fcf3c

Use user_script_config from parser in EVC

Merge pull request #652 from bouthilx/hotfix/branch_parent_child_name…

77ea6cc

…_dup Clear error message for dup branching error

Merge pull request #654 from bouthilx/hotfix/branching_keep_dim_shape

d46c8c7

Fix shape argument in categorical dims

Merge pull request #658 from bouthilx/hotfix/out_of_bound_log_int

5de01a9

Support linearize log integer properly

Revert max trials to 10e8 (like inf)

35ce5c4

Accept many arguments for workon

c7d3e0c

Fix grouping of plots in legend

6fd00a7

Signed-off-by: Fabrice Normandin <[email protected]>

Apply suggestions from code review

db385f2

Co-authored-by: Lin Dong <[email protected]>

Increase threshold of algo test...

72698c0

Merge branch 'feature/back_to_pool_size' of github.com:bouthilx/orion…

32c184c

… into feature/back_to_pool_size

Merge pull request #659 from bouthilx/feature/back_to_pool_size

10383b2

Revert to default inf max_trials and pool-size of 1

Merge pull request #662 from lebrice/fix_plotting_groups

0683944

Fix grouping of plots in legend

Doc updates for release

bd33c32

bouthilx added the release label Sep 14, 2021

bouthilx added this to the v0.1.17 milestone Sep 14, 2021

bouthilx changed the base branch from develop to master September 14, 2021 17:06

bouthilx merged commit 0ef3eea into master Sep 14, 2021

bouthilx deleted the release-v0.1.17rc1 branch September 14, 2021 18:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release candidate v0.1.17rc1 #663

Release candidate v0.1.17rc1 #663

bouthilx commented Sep 14, 2021

Release candidate v0.1.17rc1 #663

Release candidate v0.1.17rc1 #663

Conversation

bouthilx commented Sep 14, 2021

🏗 Enhancements

🐛 Bug Fixes

📜 Documentation