You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a simpler approach?
I naively would have expected that there's sth like load_chunk*s* (potentially hidden inside the load_chunk call) that would load the data from all chunks that are part of the dataset.
A related question to the approach above also pertains to performance: The viewer example load each chunk individually by calling many flushes. Is there a better(transparent way) around it (again ideally hidden behind a load_chunk*s* call)?
The text was updated successfully, but these errors were encountered:
Hello Philipp,
the code that you found seems a bit more involved than is necessary. Do I understand correctly that what you want to do is to inspect the n-dimensional blocks as they are in the bp5 files and load the slices that are there instead of selecting slices yourself?
In that case, I suggest doing:
A load_chunks() call could theoretically be introduced, but would purely be syntactical sugar as the single load_chunk() operations are all executed at once during series.flush().
Otherwise, if you want to actually load everything, slicing should not be necessary, but instead total_chunk = record_component.load_chunk() should be efficient enough in ADIOS2, but I'm not sure that's what you are looking for.
Are you running in parallel? I have a WIP branch with chunk distribution algorithms for parallel setups. If you're interested in that, we can also try sth there.
I just learned the long way that my naive approach to slice data (or even read full datasets) like
Does not play nicely with chunked data (ADIOS2/mp5 output).
I saw the the openpmd-viewer introduces quite a bit of logic to load slice data https://github.com/openPMD/openPMD-viewer/blob/6eccb608893d2c9b8d158d950c3f0451898a80f6/openpmd_viewer/openpmd_timeseries/data_reader/io_reader/utilities.py#L88
Is there a simpler approach?
I naively would have expected that there's sth like
load_chunk*s*
(potentially hidden inside theload_chunk
call) that would load the data from all chunks that are part of the dataset.A related question to the approach above also pertains to performance: The viewer example load each chunk individually by calling many flushes. Is there a better(transparent way) around it (again ideally hidden behind a
load_chunk*s*
call)?The text was updated successfully, but these errors were encountered: