Campaign class to vary multiple inputs #25

bendudson · 2023-09-08T01:29:24Z

Campaign tries to modify multiple parameters to reach a given target. For example if we have a case object:

campaign = uetools.Campaign(case, {'pcoree': 1e6, 'ncore': 6e19}, ".")

will create a campaign to start from the given case, modifying pcoree and ncore parameters to reach the given values. The final input is the path to store simulation states in.

To run the campaign in serial:

campaign.run()

Then details of the converged state are in

state = campaign.closest_state()

Campaign can also run in parallel, using multiple processes to explore paths to the target parameters e.g

campaign.run_async(processes=2)

See docstring for details and examples.

Update main branch with interpolation and input-tracking from develop

Merge dump-routines from develop

Consolidating additional input read/write routines from develop

Fix for standalone import and use of UETOOLS

Merge bugfixes from develop

Intended to automate varying multiple parameters using multiprocessing.

- If using an old version of UEDGE then isgridhdf5 will not be available. In that case make the error message more useful. - Add example and modify docstrings Some inputs can be either YAML or HDF5 files, and the code figures it out.

Takes a random walk towards the target input values, trying to converge at each step.

Uses multiple workers to find a path to the given target parameters. Includes a Parallel.QuietPool to suppress output from multiple UEDGE simulations running at the same time. Instructions in the docstrings.

Document how to run jobs asyncronously step-by-step. Need to keep calculating distance from start to target, so calculate it once and re-use.

holm10

This is great! I amended the description for the default variables that are saved to the HDF5 file.

holm10 · 2023-09-08T19:35:59Z

One thing that struck me just now: how does your algorithm handle indices, e.g. setting elements of arrays? In your example you set pcoree and ncore: pcoree is a float an unambiguous, but ncore is an array that controls the ion species separately. Commonly we are interested in varying the first index only, but for some instances and variables we are interested in varying all, a subset, or one element somewhere in the array.

bendudson · 2023-09-08T19:49:59Z

@holm10 For an array, this algorithm will change all of the elements together, for example if recycp is [0.8, 0, 0, 0, 0] and you want to change it to [0.9, 0, 0.1, 0, 0] then the algorithm should cope with that. The array is treated as one variable, so all elements of an array are interpolated between initial and final state together.

Discrete switches are more difficult, and currently all target settings are assumed to be floats or arrays of floats. In principle it should be possible to make the code decide whether or not to enable the switch. Similarly there are decisions like changing grid resolution that the algorithm might be extended to make.

The "brains" of the algorithm is the next_action() function here:
https://github.com/LLNL/UETOOLS/pull/25/files#diff-a1aa56063bcc52322aaf63f5f2640d1eb29dee8daeee2504d72e497126b075ddR339
It takes the currently known list of states, and tries to decide which parameters to vary. Currently what it does is:

Choose a starting state randomly, weighted to prefer states closer to the target
Choose randomly whether to go directly to the target, with a probability that increases closer to the target
If not going to the target value: For each parameter (an array is treated as a single parameter), choose a random point in between its current value and the target value.

This can definitely be improved. I tried to design something that has sensible preferences, but has a large element of chance so that it will (hopefully) eventually stumble on a path that works. The priority is to be robust without human intervention, not efficient in CPU time.

If an exception is thrown, then distance should be stored in the result. To ensure that it isn't used again, distance is set to a large number.

holm10 · 2023-09-08T20:07:46Z

Thanks for the clarification, @bendudson . I can think of three instances one might want to do in UEDGE for variables that are arrays:

Change the whole array

For afracs (impurity concentration), etc

Change individual indices

E.g. ncore, recyrb_use, albrb, etc.

Modify a subarray of a UEDGE variable array

E.g. modify the divertor diffusivities only using kye_use, dif_use, etc.

1 Is already taken care of by the default behavior. 2 I think could be quite easily implemented by passing a nested dict, where indices could be set individually: e.g. {'pcoree': 1e6, 'ncore': {0:6e19, 3:1e20}}. This would be more user-friendly than copying the original array and changing the individual indices before passing them. 3 is definitely the most challenging, and I am looking into some sort of solution for the continuation solve. I a settling for
try: var[indices] = target except: var = target
or something similar, where indices is a tuple of slices, e.g. (slice(0, com.ixpt1[0]+1), slice(None), 0) for setting the inner leg diffusivity for hydrogenic ions. I think the most important thing is to make it unambiguous how the setting of the variable works.

As for the next_action()-function, I understand that a CPU-intensive and human-independent approach may be better off biasing the new states towards the target value, but in my experience it is often more effective (also wall-clock) to bias towards the starting state, e.g. progressing in many small steps rather than one big step. I guess the dynamics change as the Campaign really unleashes a large number of cases, and it is enough that only one of them converges in somewhat reasonable time for it to be worthwhile. However, it's probably worth looking into how editing the bias will change the performance.

If an attempt fails more than once, set their distance to a large number so that they are not chosen again.

When new results are obtained, pickle the campaign so it can be restored later if needed.

Check whether a state is converged before including it

Adds three parameters that control how cautious the solver is. Currently fixed, but could be made adaptive in future.

If a case is not converged, when it is retried mark the original as large distance so it won't be chosen again.

Not included in the uetools package, so causes error on install.

Mypy still not happy, but improving

A uedgerc configuration is not always needed, so continue to create a Case object even if configuration is not available.

Updates UETOOLS to v1.1.0

- Update to v1.1.1 - Updates documentation - Adds catch for when running UETOOLS without UEDGE.

- Fixes backwards-compatibility issues with older UEDGE versions - Adds grip morphing routines - Updates Jupyter notebooks - Interactive plots use separate class - Interactive plots can consider different grids in same database

Updates master to 1.2.0

Conflicts: setup.py src/uetools/__init__.py uetools/UeCase/Case.py

holm10 and others added 9 commits August 18, 2023 10:44

Merge pull request LLNL#19 from LLNL/develop

6819051

Update main branch with interpolation and input-tracking from develop

Merge pull request LLNL#20 from LLNL/develop

3e5061e

Merge dump-routines from develop

Merge pull request LLNL#21 from LLNL/develop

f276150

Consolidating additional input read/write routines from develop

Merge pull request LLNL#22 from LLNL/develop

78e985e

Fix for standalone import and use of UETOOLS

Merge pull request LLNL#23 from LLNL/develop

60ef06e

Merge bugfixes from develop

Starting UeCampaign module

eada83c

Intended to automate varying multiple parameters using multiprocessing.

Case: Catch old UEDGE versions, add documentation

98f9bea

- If using an old version of UEDGE then isgridhdf5 will not be available. In that case make the error message more useful. - Add example and modify docstrings Some inputs can be either YAML or HDF5 files, and the code figures it out.

Campaign running in serial

1699282

Takes a random walk towards the target input values, trying to converge at each step.

Campaign.run_async for parallel solves

06fea3b

Uses multiple workers to find a path to the given target parameters. Includes a Parallel.QuietPool to suppress output from multiple UEDGE simulations running at the same time. Instructions in the docstrings.

bendudson changed the title ~~WIP: Campaign class to vary multiple inputs~~ Campaign class to vary multiple inputs Sep 8, 2023

bendudson and others added 2 commits September 8, 2023 10:02

Add some documentation, add total_distance member

2a7a767

Document how to run jobs asyncronously step-by-step. Need to keep calculating distance from start to target, so calculate it once and re-use.

Added description of default stored variables

186b643

holm10 approved these changes Sep 8, 2023

View reviewed changes

Campaign: Add fields on job failure

3aa07a7

If an exception is thrown, then distance should be stored in the result. To ensure that it isn't used again, distance is set to a large number.

bendudson and others added 13 commits September 8, 2023 13:30

Campaign: Discard states that fail multiple times

978d676

If an attempt fails more than once, set their distance to a large number so that they are not chosen again.

Add pickle saves to run_async

1c510e7

When new results are obtained, pickle the campaign so it can be restored later if needed.

Merge branch 'campaign' of github.com:bendudson/UETOOLS into campaign

575888c

Campaign: Don't include unconverged states in closest_state

c2e79de

Check whether a state is converged before including it

Campaign.next_action add parameters

9bc06ed

Adds three parameters that control how cautious the solver is. Currently fixed, but could be made adaptive in future.

Campaign: Prevent an unconverged case being retried multiple times

3f58e97

If a case is not converged, when it is retried mark the original as large distance so it won't be chosen again.

setup: Removing uetools.UeGui from packages list

ec2de55

Not included in the uetools package, so causes error on install.

UeCampaign type fixes

eacd81a

Mypy still not happy, but improving

Fix handling of missing casename and inputs

a301fb8

Merge branch 'main' into campaign

1d12069

Case: Keep going if configuration is missing

532c414

A uedgerc configuration is not always needed, so continue to create a Case object even if configuration is not available.

Merge pull request LLNL#28 from LLNL/develop

c28208a

Updates UETOOLS to v1.1.0

Merge pull request LLNL#29 from LLNL/develop

1b95334

- Update to v1.1.1 - Updates documentation - Adds catch for when running UETOOLS without UEDGE.

holm10 and others added 5 commits February 20, 2024 16:59

Merge pull request LLNL#30 from LLNL/develop

28b61ba

- Fixes backwards-compatibility issues with older UEDGE versions - Adds grip morphing routines - Updates Jupyter notebooks - Interactive plots use separate class - Interactive plots can consider different grids in same database

Merge pull request LLNL#31 from LLNL/develop

f004ff9

Updates master to 1.2.0

Update README.md

0ddb2ee

Update VERSION

1ba87c1

Merge branch 'main' into campaign

00b744d

Conflicts: setup.py src/uetools/__init__.py uetools/UeCase/Case.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Campaign class to vary multiple inputs #25

Campaign class to vary multiple inputs #25

bendudson commented Sep 8, 2023 •

edited

Loading

holm10 left a comment

holm10 commented Sep 8, 2023

bendudson commented Sep 8, 2023

holm10 commented Sep 8, 2023 •

edited

Loading

Campaign class to vary multiple inputs #25

Are you sure you want to change the base?

Campaign class to vary multiple inputs #25

Conversation

bendudson commented Sep 8, 2023 • edited Loading

holm10 left a comment

Choose a reason for hiding this comment

holm10 commented Sep 8, 2023

bendudson commented Sep 8, 2023

holm10 commented Sep 8, 2023 • edited Loading

bendudson commented Sep 8, 2023 •

edited

Loading

holm10 commented Sep 8, 2023 •

edited

Loading