[14.0.X] `TrackingRecHitsSoACollection`: early return `hostData` in `CopyHost::copyAsync()` when there aren't hits #45858

mmusich · 2024-09-02T12:28:32Z

backport of #45837

PR description:

This PR is meant as a fix for #45834, and builds on top of the earlier fix #45744 or issue #45708.
In a nutshell, CopyHost::copyAsync() returns before the alpaka::memcpy call in TrackingRecHitsSoACollection if the size() of the input deviceData is null (i.e. there are no input hits).
Additionally protect CAHitNtupletGenerator by returning before launching kernels when there are less than 2 hits.

PR validation:

Multiple tests have been performed with this branch:

I tested successfully (on lxplus8-gpu):

the script at OneToManyAssoc assertion faillure in HLT menu in CMSSW_14_0_15 #45834 (comment) (standard FOG-like tests for pp)
the script at Runtime crash when forcing only pixel tracking+vertexing on serial_sync backend #45708 (comment) (hybrid GPU+CPU menu, HIon-like)
runTheMatrix.py --what upgrade -l 12861.402

Additionally, following the example at #45834 (comment) I I feed the "alpaka-migrated" menu from CMSHLT-3284 into the relval machinery via:

cmsrel CMSSW_14_0_15
cd CMSSW_14_0_15/src
cmsenv
git cms-addpkg HLTrigger/Configuration
git cms-addpkg Configuration/PyReleaseValidation
hltGetConfiguration /users/soohwan/HLT_140X/Alpaka/HIonV173/V10 \
   --globaltag auto:phase1_2024_realistic \
   --mc \
   --unprescale \
   --cff > "${CMSSW_BASE}"/src/HLTrigger/Configuration/python/HLT_User_cff.py
scram b -j 20

and then apply the following patch:

diff --git a/Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py b/Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py
index 8a70a74aa0c..f9dc0a0397f 100644
--- a/Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py
+++ b/Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py
@@ -2865,7 +2865,7 @@ upgradeProperties[2017] = {
     '2022HI' : {
         'Geom' : 'DB:Extended',
         'GT':'auto:phase1_2022_realistic_hi',
-        'HLTmenu': '@fake2',
+        'HLTmenu': 'User',
         'Era':'Run3_pp_on_PbPb',
         'BeamSpot': 'DBrealistic',
         'ScenToRun' : ['GenSim','Digi','RecoNano','HARVESTNano','ALCA'],
@@ -2873,7 +2873,7 @@ upgradeProperties[2017] = {
     '2022HIRP' : {
         'Geom' : 'DB:Extended',
         'GT':'auto:phase1_2022_realistic_hi',
-        'HLTmenu': '@fake2',
+        'HLTmenu': 'User',
         'Era':'Run3_pp_on_PbPb_approxSiStripClusters',
         'BeamSpot': 'DBrealistic',
         'ScenToRun' : ['GenSim','Digi','RecoNano','HARVESTNano','ALCA'],
@@ -2881,7 +2881,7 @@ upgradeProperties[2017] = {
     '2023HI' : {
         'Geom' : 'DB:Extended',
         'GT':'auto:phase1_2023_realistic_hi',
-        'HLTmenu': '@fake2',
+        'HLTmenu': 'User',
         'Era':'Run3_pp_on_PbPb',
         'BeamSpot': 'DBrealistic',
         'ScenToRun' : ['GenSim','Digi','RecoNano','HARVESTNano','ALCA'],
@@ -2889,7 +2889,7 @@ upgradeProperties[2017] = {
     '2023HIRP' : {
         'Geom' : 'DB:Extended',
         'GT':'auto:phase1_2023_realistic_hi',
-        'HLTmenu': '@fake2',
+        'HLTmenu': 'User',
         'Era':'Run3_pp_on_PbPb_approxSiStripClusters',
         'BeamSpot': 'DBrealistic',
         'ScenToRun' : ['GenSim','Digi','RecoNano','HARVESTNano','ALCA'],

in a release that I have prepared with this and then finally run:

runTheMatrix.py --what upgrade -l 15261.0 --maxSteps 2 (neutrino gun input)
runTheMatrix.py --what upgrade -l 15224.0 --maxSteps 2 (TTbar input)

Both tests run fine.

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

Backport of #45837 to CMSSW_14_0_X for data-taking purposes.

… hits

mmusich · 2024-09-02T12:28:39Z

type bug-fix

cmsbuild · 2024-09-02T12:28:56Z

A new Pull Request was created by @mmusich for CMSSW_14_0_X.

It involves the following packages:

DataFormats/TrackingRecHitSoA (heterogeneous, reconstruction)
RecoTracker/PixelSeeding (reconstruction)

@cmsbuild, @fwyzard, @jfernan2, @makortel, @mandrenguyen can you please review it and eventually sign? Thanks.
@GiacomoSguazzoni, @JanFSchulte, @VinInn, @VourMa, @dgulhan, @felicepantaleo, @gpetruc, @missirol, @mmusich, @mtosi, @rovere this is something you requested to watch as well.
@antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

Backported from TrackingRecHitsSoACollection: early return hostData in CopyHost::copyAsync() when there aren't hits #45837

cmsbuild · 2024-09-02T12:28:57Z

cms-bot internal usage

fwyzard · 2024-09-02T12:30:18Z

enable gpu

fwyzard · 2024-09-02T12:30:20Z

please test

fwyzard · 2024-09-02T12:30:25Z

+heterogeneous

cmsbuild · 2024-09-02T17:32:15Z

+1

Size: This PR adds an extra 20KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ebf107/41223/summary.html
COMMIT: 02d008f
CMSSW: CMSSW_14_0_X_2024-09-02-1100/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/45858/41223/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially added 16 lines to the logs
Reco comparison results: 129 differences found in the comparisons
DQMHistoTests: Total files compared: 49
DQMHistoTests: Total histograms compared: 3453156
DQMHistoTests: Total failures: 2562
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3450574
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
Checked 206 log files, 170 edm output root files, 49 DQM output files
TriggerResults: no differences found

GPU Comparison Summary

Summary:

You potentially removed 20 lines from the logs
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 6
DQMHistoTests: Total histograms compared: 37044
DQMHistoTests: Total failures: 23
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 37021
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 5 files compared)
Checked 20 log files, 25 edm output root files, 6 DQM output files
TriggerResults: no differences found

jfernan2 · 2024-09-03T08:01:28Z

+1

cmsbuild · 2024-09-03T08:01:56Z

This pull request is fully signed and it will be integrated in one of the next CMSSW_14_0_X IBs (tests are also fine) and once validation in the development release cycle CMSSW_14_2_X is complete. This pull request will now be reviewed by the release team before it's merged. @rappoccio, @antoniovilela, @sextonkennedy, @mandrenguyen (and backports should be raised in the release meeting by the corresponding L2)

antoniovilela · 2024-09-04T14:01:51Z

+1

TrackingRecHitsSoACollection: early return hostData when there aren't…

02d008f

… hits

cmsbuild added this to the CMSSW_14_0_X milestone Sep 2, 2024

cmsbuild added reconstruction-pending pending-signatures tests-pending orp-pending bug-fix backport heterogeneous-pending tracking labels Sep 2, 2024

cmsbuild added tests-started heterogeneous-approved and removed tests-pending heterogeneous-pending labels Sep 2, 2024

cmsbuild added tests-approved and removed tests-started labels Sep 2, 2024

cmsbuild added reconstruction-approved fully-signed and removed reconstruction-pending pending-signatures labels Sep 3, 2024

cmsbuild removed the backport label Sep 3, 2024

cmsbuild added the backport-ok label Sep 3, 2024

cmsbuild added orp-approved and removed orp-pending labels Sep 4, 2024

cmsbuild merged commit a7fc774 into cms-sw:CMSSW_14_0_X Sep 4, 2024
12 checks passed

mmusich deleted the mm_fix_TrackingRecHitsSoACollection_14_0_X branch September 4, 2024 15:29

cmsbuild mentioned this pull request Sep 4, 2024

[14_0_X] Alpaka Pixel: Write nTracks=0 When No Track Stored #45877

Merged

mmusich mentioned this pull request Sep 6, 2024

OneToManyAssoc assertion faillure in HLT menu in CMSSW_14_0_15 #45834

Closed

mandrenguyen mentioned this pull request Sep 6, 2024

Build CMSSW_14_0_15_patch1 #45946

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[14.0.X] `TrackingRecHitsSoACollection`: early return `hostData` in `CopyHost::copyAsync()` when there aren't hits #45858

[14.0.X] `TrackingRecHitsSoACollection`: early return `hostData` in `CopyHost::copyAsync()` when there aren't hits #45858

mmusich commented Sep 2, 2024

mmusich commented Sep 2, 2024

cmsbuild commented Sep 2, 2024 •

edited

Loading

cmsbuild commented Sep 2, 2024 •

edited

Loading

fwyzard commented Sep 2, 2024

fwyzard commented Sep 2, 2024

fwyzard commented Sep 2, 2024

cmsbuild commented Sep 2, 2024

jfernan2 commented Sep 3, 2024

cmsbuild commented Sep 3, 2024

antoniovilela commented Sep 4, 2024

[14.0.X] TrackingRecHitsSoACollection: early return hostData in CopyHost::copyAsync() when there aren't hits #45858

[14.0.X] TrackingRecHitsSoACollection: early return hostData in CopyHost::copyAsync() when there aren't hits #45858

Conversation

mmusich commented Sep 2, 2024

PR description:

PR validation:

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

mmusich commented Sep 2, 2024

cmsbuild commented Sep 2, 2024 • edited Loading

cmsbuild commented Sep 2, 2024 • edited Loading

fwyzard commented Sep 2, 2024

fwyzard commented Sep 2, 2024

fwyzard commented Sep 2, 2024

cmsbuild commented Sep 2, 2024

Comparison Summary

GPU Comparison Summary

jfernan2 commented Sep 3, 2024

cmsbuild commented Sep 3, 2024

antoniovilela commented Sep 4, 2024

[14.0.X] `TrackingRecHitsSoACollection`: early return `hostData` in `CopyHost::copyAsync()` when there aren't hits #45858

[14.0.X] `TrackingRecHitsSoACollection`: early return `hostData` in `CopyHost::copyAsync()` when there aren't hits #45858

cmsbuild commented Sep 2, 2024 •

edited

Loading

cmsbuild commented Sep 2, 2024 •

edited

Loading