Fix memory leak peaksplitting #309

JoranAngevaare · 2020-08-28T15:37:11Z

What is the problem / what does the code in this PR do
The way hitlets were now using the peak splitting induced a serious memory leak. Also it utterly degraded strax' performance.

Can you briefly describe how it works?
I removed the associated changes in #275 that induced this bug.

Can you describe the symptoms?
When running straxer (e.g. like this):

python -m memory_profiler straxer.py 170206_1355 --target peaklets --context xenon1t_dali --build_lowlevel

Would yield something like:

Got 1525 items. Now 629.4 sec / 17.5% into the run. Using 7034.7 MB RAM. ETA 26021.48 sec.
....
Got 1936 items. Now 756.6 sec / 21.0% into the run. Using 8234.6 MB RAM. ETA 25669.28 sec.

Until the memory usage went up to 45 GB! Also the processing speed by a factor of ~5.

update fork

WenzDaniel

I have one more idea which prevents us from copy-pasting the code. I would like try this first before we use your suggestion. Its a bit inspired by your solution. Can you forward me the memory and performance tests you did to so I can compare?

WenzDaniel · 2020-08-29T08:27:23Z

strax/processing/peak_splitting.py

+                # is computed
+                r['dt'] = orig_dt
+                r['length'] = (split_i - prev_split_i) * p['dt'] / orig_dt
+                r['max_gap'] = -1  # Too lazy to compute this


hitlets does not support 'max_gap' so simply remove this line.

Fixed in 17b4c4e

WenzDaniel · 2020-08-29T09:41:31Z

Here is a different kind of solution #310 which avoids the copy and pasting of the code.

WenzDaniel · 2020-08-30T06:30:37Z

strax/processing/peak_splitting.py

+                       _result_buffer=None, result_dtype=None):
+        """Loop over peaks, pass waveforms to algorithm, construct
+        new peaks if and where a split occurs.
+        """


Please add here some warning that changes in this function might also be applied in _split_peaks

Thanks Daniel, did so in 17b4c4e

JoranAngevaare · 2020-08-31T10:33:47Z

Thanks Daniel, I do think we do indeed have a different opinion on this one. (see #310)

I just addressed your feedback, would you mind verifying if this line is okay for some nVeto data (I don't have any at hand):

strax/strax/processing/peak_splitting.py

Line 176 in 17b4c4e

@strax.growing_result(dtype=strax.hitlet_dtype(), chunk_size=int(1e4))

WenzDaniel · 2020-08-31T17:07:03Z

strax/processing/peak_splitting.py

-        else:
-            raise TypeError(f'Unknown data_type. "{data_type}" is not supported.')
-        new_peaks = self._split_peaks(
+        new_peaks = split_function[data_type](
            # Numba doesn't like self as argument, but it's ok with functions...
            split_finder=self.find_split_points,
            peaks=peaks,


You have to change these lines into arguments, since peaks is called hits in _split_hitlets

WenzDaniel

Looks fine go ahead

JoranAngevaare added 4 commits August 18, 2020 16:18

Merge pull request #22 from AxFoundation/master

3cfd250

update fork

Remove hitlet induced memory leak

d800492

fix hitlet processing

cb52792

fix codefactor

7127c79

WenzDaniel reviewed Aug 29, 2020

View reviewed changes

WenzDaniel mentioned this pull request Aug 29, 2020

Fix memory leak. #310

Closed

WenzDaniel reviewed Aug 30, 2020

View reviewed changes

address feedback

17b4c4e

WenzDaniel reviewed Aug 31, 2020

View reviewed changes

fix hitlets splits

9d123f4

WenzDaniel approved these changes Aug 31, 2020

View reviewed changes

WenzDaniel merged commit e94b1c7 into AxFoundation:master Aug 31, 2020

JoranAngevaare deleted the fix_memory_leak_peaksplitting branch August 31, 2020 18:34

JoranAngevaare mentioned this pull request Apr 29, 2021

Refactor concat and get data #430

Merged

4 tasks

WenzDaniel mentioned this pull request May 3, 2021

Refactor hitlets #436

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leak peaksplitting #309

Fix memory leak peaksplitting #309

JoranAngevaare commented Aug 28, 2020

WenzDaniel left a comment

WenzDaniel Aug 29, 2020

JoranAngevaare Aug 31, 2020

WenzDaniel commented Aug 29, 2020

WenzDaniel Aug 30, 2020

JoranAngevaare Aug 31, 2020 •

edited

Loading

JoranAngevaare commented Aug 31, 2020

WenzDaniel Aug 31, 2020

WenzDaniel left a comment

Fix memory leak peaksplitting #309

Fix memory leak peaksplitting #309

Conversation

JoranAngevaare commented Aug 28, 2020

WenzDaniel left a comment

Choose a reason for hiding this comment

WenzDaniel Aug 29, 2020

Choose a reason for hiding this comment

JoranAngevaare Aug 31, 2020

Choose a reason for hiding this comment

WenzDaniel commented Aug 29, 2020

WenzDaniel Aug 30, 2020

Choose a reason for hiding this comment

JoranAngevaare Aug 31, 2020 • edited Loading

Choose a reason for hiding this comment

JoranAngevaare commented Aug 31, 2020

WenzDaniel Aug 31, 2020

Choose a reason for hiding this comment

WenzDaniel left a comment

Choose a reason for hiding this comment

JoranAngevaare Aug 31, 2020 •

edited

Loading