Sample: Using Dask with ESPREsSo #4781

RudolfWeeber · 2023-09-04T07:28:13Z

This was produced as a side project of learning Dask, but might be useful for others.

SamTov

I think it looks really nice and is a great addition to the samples as it will help with really high throughout simulation studies. I have some questions scattered throughout the review but I also wanted to ask one here:

How does it deal with having more jobs than can be run at one time. For example, if I open 5 workers on a slurm cluster, can I keep passing jobs to these 5 workers or do they close after a simulation is finished? I didn't see any closing in the script so I assume it is the former but how does that work exactly?

samples/high_throughput_with_dask/dask_espresso.py

samples/high_throughput_with_dask/dump_test_output.py

samples/high_throughput_with_dask/echo.py

samples/high_throughput_with_dask/dask_espresso.py

samples/high_throughput_with_dask/echo.py

SamTov · 2023-09-04T14:04:01Z

samples/high_throughput_with_dask/run_pv.py

+VOLUME_FRACTIONS = np.arange(0.1, 0.52, 0.01)
+
+
+client = dask.distributed.Client(sys.argv[1])


is the argument theoretically supposed to be either a Cluster instance or None or is it something different altogether?

I made it clear that this is a scheuler address that LocalCluster does not work and clusters with remote workers probably will.

RudolfWeeber · 2023-09-04T15:57:17Z

Answering the general questoin: the workers stay alive and can be re-used until they are explicitly shut down. Espresso globals are kept out of the worker by running Espresso in a sub-process, i.e., in an independent Python instance. This make sthe serialization of input and output via pickle and base64 necessary, so they can be safely passed via stdion and stdout.

RudolfWeeber · 2023-09-05T08:45:11Z

I also added some docstrings and comments throughout the sample.

RudolfWeeber · 2023-09-19T15:20:44Z

Anything still open here?

SamTov · 2023-09-26T09:13:04Z

Anything still open here?

I was asked by @jngrad to run this solution inside of our RL workflow in order to correctly assess whether it resolves the issues raised during our meetings. This will, however, take a little bit of time as we need to restructure the SwarmRL code such that it fits this structure. I think the code here works and is well written but whether it will solve the issues with our distributed deployment is still an open issue.

Co-authored-by: Rudolf Weeber <[email protected]>

jngrad

LGTM

Sample: Using Dask with ESPREsSo

ed4eaf1

SamTov reviewed Sep 4, 2023

View reviewed changes

jngrad and others added 2 commits October 31, 2023 16:29

Document Dask scheduler

39038ef

Co-authored-by: Rudolf Weeber <[email protected]>

Merge branch 'python' into dask

a8d4806

jngrad force-pushed the dask branch from 6a959fb to a8d4806 Compare October 31, 2023 15:30

jngrad approved these changes Oct 31, 2023

View reviewed changes

jngrad added Documentation automerge Merge with kodiak labels Oct 31, 2023

jngrad added this to the ESPResSo 4.3.0 milestone Oct 31, 2023

kodiakhq bot merged commit b70d9e8 into espressomd:python Oct 31, 2023
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample: Using Dask with ESPREsSo #4781

Sample: Using Dask with ESPREsSo #4781

RudolfWeeber commented Sep 4, 2023

SamTov left a comment

SamTov Sep 4, 2023

RudolfWeeber Sep 5, 2023

RudolfWeeber commented Sep 4, 2023

RudolfWeeber commented Sep 5, 2023

RudolfWeeber commented Sep 19, 2023

SamTov commented Sep 26, 2023 •

edited

Loading

jngrad left a comment

		VOLUME_FRACTIONS = np.arange(0.1, 0.52, 0.01)


		client = dask.distributed.Client(sys.argv[1])

Sample: Using Dask with ESPREsSo #4781

Sample: Using Dask with ESPREsSo #4781

Conversation

RudolfWeeber commented Sep 4, 2023

SamTov left a comment

Choose a reason for hiding this comment

SamTov Sep 4, 2023

Choose a reason for hiding this comment

RudolfWeeber Sep 5, 2023

Choose a reason for hiding this comment

RudolfWeeber commented Sep 4, 2023

RudolfWeeber commented Sep 5, 2023

RudolfWeeber commented Sep 19, 2023

SamTov commented Sep 26, 2023 • edited Loading

jngrad left a comment

Choose a reason for hiding this comment

SamTov commented Sep 26, 2023 •

edited

Loading