Multiprocessing #486

nkrah · 2024-10-09T23:54:00Z

Enable GATE 10 to split a simulation into multiple parallel processes.
THIS IS WORK IN PROGRESS

First implemented items:

split run timing intervals
adapt dynamic objects (run-based)
spawn processes via Pool
write output into a separate subfolder per process

Still missing:

merge actor output from different processes

dsarrut · 2024-10-11T09:00:20Z

opengate/managers.py

            output = se.run_engine()
        return output

-    def run(self, start_new_process=False):
+    def generate_run_timing_interval_map(self, number_of_processes):
+        if number_of_processes % len(self.run_timing_intervals) != 0:


why ? I thought we just divide ALL time_interval by the number_of_processes

Yes, but letting the user define the total number of processes rather than the process per run is more intuitive and will not require an API change if we implement a more advanced splitting scheme in the future. So I think it's better this way.

nkrah · 2024-10-11T09:03:11Z

Yes, but letting the user define the total number of processes rather than the process per run is more intuitive and will not require an API change if we implement a more advanced splitting scheme in the future. So I think it's better this way.

…

On Oct 11 2024, at 11:00 am, David Sarrut ***@***.***> wrote: @dsarrut commented on this pull request. In opengate/managers.py (#486 (comment)): > output = se.run_engine() return output - def run(self, start_new_process=False): + def generate_run_timing_interval_map(self, number_of_processes): + if number_of_processes % len(self.run_timing_intervals) != 0: why ? I thought we just divide ALL time_interval by the number_of_processes — Reply to this email directly, view it on GitHub (#486 (review)), or unsubscribe (https://github.com/notifications/unsubscribe-auth/AIFQFYM2QBJ7YWBUP4ELK2TZ26HTVAVCNFSM6AAAAABPVS5KBSVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGNRSGMYTOOBUHE). You are receiving this because you authored the thread.

nkrah · 2024-10-11T09:09:15Z

I figured out a flexible mechanism to merge data back into one single actor output (if data is mergeable: true for images, not true yet for ROOT).
We will need a new type of method, common to all actors, namely FinalizeSimulation(), to be triggered from the Simulation after all processes have finished. Writing the combined output (from the processes) to disk will be done in FinalizeSimulation(). The EndOfSimulation(), where writing currently takes place, is called inside the process and therefore before combining the output. We can also add an option to not store intermediate, i.e. per process, output on disk if not needed. For example: images are accessible directly via memory and can be merged that way. No need to access data from disk.

Note: FinalizeSimulation() will not have access to engines because they do not exist any more outside of the subprocess.

…a_data

nkrah · 2024-10-28T16:22:50Z

New:
The following actors now work in multiprocessing (local machine):

SimulationStatisticsActor: data is merged in memory and accessible after the simulation; written to disk if requested
Actors with ROOT output: root files (from subdirectories per process) are merged into new root file in main output folder structure. Event IDs are automatically incremented. RunIDs are recreated as per the original simulation.

Works with test019_phsp_actor -> created a new variant of the test.

Still need to create variants of other tests that use ROOT output to check.

…och time rather than date time str

BishopWolf · 2024-11-13T08:23:56Z

@nkrah I think all actors shall have atomic variables, this way all actors will be thread safe by default, watch this library https://pypi.org/project/atomicx/ . It already implemented the atomic doubles on my suggestion

from atomicx import AtomicFloat

# Create an atomic float with an initial value of 0.0
atom = AtomicFloat()
print(f"Initial Value: {atom.load()}")

# Perform atomic operations
atom.store(3.14)
value = atom.load()
print(f"Value: {value}")

# See docs for more operations

nkrah · 2024-11-13T08:37:30Z

@BishopWolf Thanks for the suggestion. I think atomic doubles will be useful for certain parts of the actors.

Bear in mind that this PR is about multiprocessing, i.e. running a (independent) simulation in a newly spawn process. There is no issue with shared memory handling in this case.

Concerning multithreading: We are actually using the multithreading architecture from Geant4 which means that not every part of a simulation runs in separate threads, only certain methods. Therefore, only certain shared data structures, e.g. images into which all threads write, need to be thread safe. Currently, there is no python-side function that accesses shared data on a per-thread basis, only C++ functions. In case this changes in the future, I think the package you suggest could be a good option.

nkrah · 2024-11-28T13:53:05Z

I will pick this up again once PR #599 is done.

nkrah and others added 7 commits October 10, 2024 01:50

Correct typo in comment

fe4c659

Implement DynamicGateObject.reassign_subset_of_dynamic_params()

b66c665

Add attribute process_index to SimulationEngine (not used yet)

0456d04

First steps towards multi processing

30526c8

Add test080_multiprocessing_1.py

a7434e5

Add test030_dose_motion_dynamic_param_multiproc.py

605192f

[pre-commit.ci] Automatic python and c++ formatting

e4bc3a7

dsarrut reviewed Oct 11, 2024

View reviewed changes

nkrah added 20 commits October 13, 2024 02:37

Implement MultiProcessingHandler classes

523fdc9

create test080_multiprocessing_handler.py

cea24cf

Implement import_data_from_actor_output()

ae37675

Implement import_user_output_from_actor()

9c74aa2

Implement import_user_output_from_actor in ActorBase

87af53e

Change local variable name in reassign_dynamic_params_for_process()

196d954

Implement FinalizeSimulation() in VoxelDepositActor

138a44f

Add simulation_id to SimulationOutput

a796bec

Implement SimulationOutput.store_output_from_simulation_engine()

1375e4f

Simplify code in run_engine()

fc90573

Implement SimulationMetaData class

91f6b7f

Use SimulationMetaData in Simulation

722552a

Update MultiProcessingHandlerBase class

c7a317b

Remove obsolete generate_run_timing_interval_map() method

52bde4f

Update imports in managers.py

3d1448b

Update run_in_process()

fc0d155

Store number_of_sub_processes and start_new_process in simulation_met…

8b43fb2

…a_data

Introduce avoid_write_to_disk_in_subprocess kwarg in Simulation.run()

a544da9

Use multi_proc_handler in Simulation.run()

19c9b17

Trigger import_user_output_from_actor() after multi_proc run

23fb55e

nkrah and others added 5 commits October 28, 2024 17:15

Remove merge_root() and unicity(): now in ActorOutputRoot class

43a8292

Implement ActorOutputRootmerge_data_from_actor_output()

facc182

Add test019_phsp_actor_multiproc.py

5d06d5b

[pre-commit.ci] Automatic python and c++ formatting

01d0ea5

Remove obsolete imports in processing.py

245b73b

nkrah and others added 20 commits October 29, 2024 14:25

Add property counts to SimulationStatisticsActor

aaed7a5

remove obsolete UserInterfaceToActorOutputStatisticsActor

c70cfef

Extend DataItem.__getattr__ to get attributes from self.data if possible

f16c1ea

Adapt read_stat_file_legacy() to updated stats actor

3423476

Adapt assert_stats() and assert_stats_json() to updated stats actor

fc7322a

Merge sim_start_time and sim_stop_time via min/max

a8f9add

Adapt assert_stats_json() to updated StatsActor

2210788

Improve handling of start_time and stop_time in StatisticsDataItem

d482c62

Update GateSimulationStatisticsActor::GetCounts()

1473df0

In GateSimulationStatisticsActor: save start_time and stop_time as ep…

014e950

…och time rather than date time str

adapt read_stat_file_json() to updated StatsActor

451e7da

remove redundant code in test008_dose_actor_multiproc.py

f6f348b

Merge remote-tracking branch 'origin/master' into multiproc

09b4830

[pre-commit.ci] Automatic python and c++ formatting

9e69420

activate ssh session on github

0eb6b81

update test082_multiprocessing_1.py

bbec500

pick up simulation_id as meta_data after a simulation

ec56329

add explicit process_cls() for MultiProcessHandler classes

039ef82

Update file names in test081_simulation_optigan_with_random_seed.py

dd3068c

Adapt test006_runs.py to updated StatsActorOutput

0ac5519

nkrah mentioned this pull request Nov 29, 2024

Multiprocessing -towards merge #618

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiprocessing #486

Multiprocessing #486

nkrah commented Oct 9, 2024

dsarrut Oct 11, 2024

nkrah Oct 11, 2024

nkrah commented Oct 11, 2024 via email

nkrah commented Oct 11, 2024

nkrah commented Oct 28, 2024

BishopWolf commented Nov 13, 2024 •

edited

Loading

nkrah commented Nov 13, 2024

nkrah commented Nov 28, 2024

Multiprocessing #486

Are you sure you want to change the base?

Multiprocessing #486

Conversation

nkrah commented Oct 9, 2024

dsarrut Oct 11, 2024

Choose a reason for hiding this comment

nkrah Oct 11, 2024

Choose a reason for hiding this comment

nkrah commented Oct 11, 2024 via email

nkrah commented Oct 11, 2024

nkrah commented Oct 28, 2024

BishopWolf commented Nov 13, 2024 • edited Loading

nkrah commented Nov 13, 2024

nkrah commented Nov 28, 2024

BishopWolf commented Nov 13, 2024 •

edited

Loading