-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alpaka vs CUDA DQM compare modules for pixel tracks objects #43964
Conversation
cms-bot internal usage |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-43964/38859
|
A new Pull Request was created by @borzari for master. It involves the following packages:
@mmusich, @syuvivida, @AdrianoDee, @saumyaphor4252, @miquork, @nothingface0, @perrotta, @tjavaid, @antoniovilela, @fabiocos, @cmsbuild, @sunilUIET, @mandrenguyen, @davidlange6, @fwyzard, @Martin-Grunewald, @makortel, @rappoccio, @consuegs, @rvenditti, @jfernan2, @antoniovagnerini, @srimanob, @subirsarkar can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
-hlt |
fragment.hltSiPixelRecHitsSoACompareGPUvsCPU = cms.EDProducer( "SiPixelPhase1CompareRecHitsSoA", | ||
pixelHitsSrcCPU = cms.InputTag( "hltSiPixelRecHitsSoA@cpu" ), | ||
pixelHitsSrcGPU = cms.InputTag( "hltSiPixelRecHitsSoA@cuda" ), | ||
fragment.hltSiPixelRecHitsSoACompareGPUvsCPU = cms.EDProducer( "SiPixelPhase1CompareRecHits", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you are not supposed to touch these files directly.
Please provide a suitable customization function in https://github.com/cms-sw/cmssw/blob/master/HLTrigger/Configuration/python/customizeHLTforCMSSW.py that modifies these configurations.
STORM will take care of migrating these files centrally.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Marco: we're on that!
(there was a miss-communication between me and Breno).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @mmusich. Yes, this was my mistake. I was just talking to @AdrianoDee about it and he explained that I should create a customization instead of changing fragments directly. I'll remove the last commit and make the customization
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-1 |
3ea8509
to
4a6bd2e
Compare
Also, it is unclear why DQM comparisons affect the HLT menu; I think it needs to be discussed how many modules we add and execute for this comparison to the HLT menu. Will this also need to run at P5, or only for offline workflows? If the former, it needs to be assessed for timing. If the latter, it definitely does NOT go in the HLT menus in HLTrigger/Configuration, and has to remain a DQM specific customisation routine called in a cmsDriver DQM step and put in some DQM subsystem. |
This is because there's a path for CPUvsGPU checks |
Only one path in the HLT menu is modified? |
Hi @mmusich. I was working on fixing the conflicts and matching the AlpakavsCUDA customizations with the Alpaka ones, and just finished some local tests using wf |
I think it would still be beneficial if this could be discussed in a wider forum (e.g. a Joint TRK DPG / POG meeting). |
@mmusich Do I understand correctly that the issue is related to
Does it mean that the track SoAs can be persisted OK? |
yes.
no, please read carefully #44700 (comment).
the only product we can safely persists are pixel vertices. |
I don't have measurements, but judging from the multiplicities, I think it's a reasonable assumption.
do you care to expand? |
I meant to drop the DQM monitors/modules from the HLT table for parts that can be persisted and run those in the client. |
as mentioned we can do it (only) for the vertices for the time being. If @cms-sw/tracking-pog-l2 prefers to do that, I think we are set to do it, but please do chime in in the ticket https://its.cern.ch/jira/browse/CMSHLT-3147 clearly explaining your preference and motivations. |
I'm guessing that the resource gain will be small using just vertices (unless there's a record that this code is significantly slow). |
@borzari can you clarify on the status / plans for this PR? |
@mmusich I am closing this PR, as most of the changes are not needed anymore. I am going to open a new PR just with changes to the Alpaka DQM modules, and some others that were to be included in this PR (I'll list them in the new PR to make it clear), to be included in CMSSW_14_1_X only. I was making some tests with the changes and had a few issues, that I solved now, but it took a while. I am opening the new PR later today and will tag you to check it |
PR description:
This implements DQM modules for comparing pixel tracks objects reconstructed using Alpaka and CUDA. These modules can be used for any possible comparisons using pixel tracks: CUDA CPU vs CUDA GPU, Alpaka Serial vs Alpaka Device and Alpaka vs CUDA (both Serial/CPU and Device/GPU). A new process modifier chain,
alpakaCUDAValidation
, was added to turn on both reconstructions and the Alpaka vs CUDA DQM modules when needed.A new workflow offset,
.5041
, was added to run DQM with all the possible comparisons using pixel tracks objects. A new customization to run Alpaka vs CUDA comparisons in HLT was also created.These changes will be back ported to CMSSW_14_0_X for the special validation of the Alpaka vs CUDA reconstruction.
PR validation:
The modifications were tested using wfs
12434.403
(Alpaka validation),12434.5041
(Alpaka vs CUDA validation), and12434.503
(CUDA validation). All comparisons follow the expectations.