-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[14_0_X SIM] ZDC problem in hlt_mc_HIon test #43582
Comments
cms-bot internal usage |
A new Issue was created by @civanch Vladimir Ivantchenko. @Dr15Jones, @smuzaffar, @sextonkennedy, @antoniovilela, @makortel, @rappoccio can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign simulation |
New categories assigned: simulation @civanch,@mdhildreth you have been requested to review this Pull request/Issue and eventually sign? Thanks |
@cms-sw/hlt-l2 FYI |
I am currently seeing a similar problem running the HI workflow in latest CMSSW_14_1_X IB: An exception of category 'Conditions not found' occurred while To reproduce the problem I did: cmsrel CMSSW_14_1_X_2024-03-29-1100 cmsDriver.py Configuration/Generator/python/Starlight_DoubleDiffraction_5p36TeV_cfi.py -s LHE,GEN,SIM -n 40 --conditions auto:phase1_2023_realistic_hi --beamspot Realistic2022PbPbCollision --datatier GEN-SIM --eventcontent RAWSIM --era Run3_pp_on_PbPb_2023 --geometry DB:Extended --relval 9000,150 --fileout file:step1.root cmsDriver.py step2 -s DIGI:pdigi_hi_nogen,L1,DIGI2RAW,HLT:@Fake2 --conditions auto:phase1_2023_realistic_hi --datatier GEN-SIM-DIGI-RAW-HLTDEBUG --eventcontent FEVTDEBUGHLT --era Run3_pp_on_PbPb_2023 -n -1 --pileup HiMixNoPU --filein file:step1.root --fileout file:step2.root |
@cms-sw/hcal-dpg-l2 FYI |
This is to explicitly include Sunanda ( @bsunanda ). |
Starting from CMSSW_14_1_X_2024-04-24-2300 the
occurs frequently (but not always) in the step2 of workflows 180.1 and 181.1 (that were added/enabled in that IB). |
Commenting here to flag that experts from HCAL and ZDC side are aware (as flagged above by @abdoulline, @bsunanda) and are still working towards a solution. |
@civanch @abdoulline all. |
In the recent update to ZDC geometry which is now done for 2024, ZDC digitization is open like all other detectors. However, this is true for CMSSW_14_1_X once we update the GT. If we have to do it for 14_0_X we need a lot of backporting which may not be easy.
…________________________________
From: Andrea Perrotta ***@***.***>
Sent: 20 September 2024 12:16
To: cms-sw/cmssw ***@***.***>
Cc: Sunanda Banerjee ***@***.***>; Mention ***@***.***>
Subject: Re: [cms-sw/cmssw] [14_0_X SIM] ZDC problem in hlt_mc_HIon test (Issue #43582)
@civanch<https://github.com/civanch> @abdoulline<https://github.com/abdoulline> all.
The issue is reappearing in the production of the premix samples for the 2024MC campaign, see gitlab<https://gitlab.cern.ch/cms-ppd/event-performance/ep-coordination/-/issues/3#note_8460920>
I read in the initial post of this issue that "Temporary ZDC hits are masked until the problem will be solved": what does it mean? I.e., was it merged a PR that actually masked those "temporary ZDC hits"? And, if so: why such protection is apparently not being effective now?
—
Reply to this email directly, view it on GitHub<#43582 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGMZORWMM5BN7RNCPEX26TZXPACRAVCNFSM6AAAAABAXQQUGOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNRSHE2TCNBTG4>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
I must admit don't have a clear understanding of what's happening... I wonder if it may (or may not) be related to the absence in 14_0_X of a small fix of HcalZDCDetId #45033 which was submitted (as new ZDC-only related) to 14_1_X... |
Indeed, that fix was never backported in 14_0_X... |
We can make a temporary fix in 14_0_X to let it proceed. Should we try to do that?
…________________________________
From: Andrea Perrotta ***@***.***>
Sent: 20 September 2024 12:32
To: cms-sw/cmssw ***@***.***>
Cc: Sunanda Banerjee ***@***.***>; Mention ***@***.***>
Subject: Re: [cms-sw/cmssw] [14_0_X SIM] ZDC problem in hlt_mc_HIon test (Issue #43582)
I must admit don't have a clear understanding of what's happening... This cell (0x54000140) is illegal...
I wonder if it may (or may not) be related to the absence in 14_0_X of a small fix of HcalZDCDetId #45033<#45033> which was submitted (as new ZDC-only related) to 14_1_X...
Indeed, that fix was never backported in 14_0_X...
We can try running with it.
Your understanding is that if there was not such an illegal cell, the protection should have worked in 14_0_X: do I understand it correctly?
—
Reply to this email directly, view it on GitHub<#43582 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGMZOTJD3HXQVGDPTTXUUTZXPB7XAVCNFSM6AAAAABAXQQUGOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNRSHE3TMMBZG4>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@perrotta |
Let us do. It should not affect other detectors. |
I've prepared the backport in #46070 |
@abdoulline , there is era dependent option CMStoZDCtransport = True/False. |
I believe there is something that is percolating not right. The masking in HcalZDCDetId need not be changed. But i do not understand the issue since I did not follow it from the beginning.
…________________________________
From: Vladimir Ivantchenko ***@***.***>
Sent: 20 September 2024 12:50
To: cms-sw/cmssw ***@***.***>
Cc: Sunanda Banerjee ***@***.***>; Mention ***@***.***>
Subject: Re: [cms-sw/cmssw] [14_0_X SIM] ZDC problem in hlt_mc_HIon test (Issue #43582)
@abdoulline<https://github.com/abdoulline> , there is era dependent option CMStoZDCtransport = True/False.
—
Reply to this email directly, view it on GitHub<#43582 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGMZOTUOCK3G5H7ZPJER4DZXPEFLAVCNFSM6AAAAABAXQQUGOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNRTGAYDKMZSG4>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
OK, I see. But absence of ZDC SimHits does not prevent HcalMCParams to be initialized for all the (valid) ZDC (from Geometry initialization) cells... |
If it is "false" no particle can go throuw the volume CMStoZDC - all are killed. |
May be that ZDCDetId (not fixed in 14_0_X) is not the culprit, as it's related to RPD channels, which should not exist in 14_0_X anyway. I've just dumped and parsed and re-redumped txt input of existing DB conditions (HcalMCParams) with legacy ZDC (22 ch. EM+HAD+LUM) in 14_0_15 and there is no problem, no illegal numbers. In the illegal ZDCDetId no ZDC section is defined... Wild guess - may be some old (12_X/13_X) Digi are involved somehow? - No, at the first glance: |
@bsunanda Begin processing the 33rd record. Run 1, Event 33, LumiSection 1 on stream 0 at 20-Sep-2024 15:19:40.916 CEST [1] cmsDriver.py Configuration/Generator/python/Starlight_DoubleDiffraction_5p36TeV_cfi.py -s LHE,GEN,SIM -n 40 --conditions auto:phase1_2023_realistic_hi --beamspot Realistic2022PbPbCollision --datatier GEN-SIM --eventcontent RAWSIM --era Run3_pp_on_PbPb_2023 --geometry DB:Extended --relval 9000,150 --fileout file:step1.root cmsDriver.py step2 -s DIGI:pdigi_hi_nogen,L1,DIGI2RAW --conditions auto:phase1_2023_realistic_hi --datatier GEN-SIM-DIGI-RAW-HLTDEBUG --eventcontent FEVTDEBUGHLT --era Run3_pp_on_PbPb_2023 -n -1 --pileup HiMixNoPU --filein file:step1.root --fileout file:step2.root |
@bsunanda Caveat: |
@bsunanda we need your insight, I'm afraid, about it... |
this issue is hitting us again bit time for launching the Summer24-24 premix library, and we ought to find a solution to this. I understand that reproducibility is an issue, and we might have to just roll back anything related to ZDC to get out of this, if no solution can be found |
I may be wrong but ZDC is not enabled for pp runs. Do i understand this correctly? Or this was only in past? If my understanding is true, then for pp simulations in 2024 ZDC is not need and also ZDC is not needed for production of premix library. ZDC should be enabled only for HI simulation. When we enable ZDC for pp we get a significant factor slow down simulation (about 3-5 if my memory is correct). This happens for MinBias simulation because high energy hadrons hit ZDC and a full very energetic shower is simulated (without Russian roulette or other short-cut. So, I am not sure if we should backport ZDC software to 14_0. |
In 2023, the ZDC was enabled only in the case of the pp reference run occurring just before the heavy-ion run (which will also be the case in 2024). I am not sure if the reference run is handled differently in MC, but we have no need for ZDC pp simulations at 13.6 TeV as the ZDC was not included in these runs. Hopefully this helps, especially if this is slowing down the simulation. |
@bsunanda some additional recent info/observations : (1) CMSSW_14_0_X_2024-10-05-1100 (the most recent 14_0_X IB) on lxplus8: (2) CMSSW_14_0_X_2024-10-05-1100 + pending PR #46246 (backport of what was recently merged into 14_1_X & 14_2_X): (3) CMSSW_14_0_X_2024-10-05-1100 + commented ZDCDigitizer: |
just to be a bit clearer : 14.0.17 pilot for the premix library is failing with cmsRun1
with Configuration/GenProduction/python/PPD-RunIIISummer24PrePremix-00002-fragment.py with /MinBias_TuneCP5_13p6TeV-pythia8/RunIII2024Summer24GS-140X_mcRun3_2024_realistic_v20-v1/GEN-SIM produced in 14.0.13 using
with Configuration/GenProduction/python/PPD-RunIII2024Summer24GS-00002-fragment.py which is somehow the topic of the issue here, and hence I am saying that whichever ZDC code was aded in 14.0 is interfering and preventing 14.0 to be used for pp simulation. whichever solution you, the experts, will come up with to get this solved is good for us, as long as 14.0 is usable for pp simulation, that we need to launch urgently. |
I guess it's unrealistic to reproduce the issue in a private setup 🤔 I'm running it in 14_0_17, it takes ~3-4 s/ev on lxplus8... |
@abdoulline would you like to pursue your suggestion |
@perrotta |
@perrotta Can I try it out by today to see if the crash can be avoided in step2? I need today tp work on this
…________________________________
From: Salavat Abdullin ***@***.***>
Sent: 07 October 2024 13:25
To: cms-sw/cmssw ***@***.***>
Cc: Sunanda Banerjee ***@***.***>; Mention ***@***.***>
Subject: Re: [cms-sw/cmssw] [14_0_X SIM] ZDC problem in hlt_mc_HIon test (Issue #43582)
@perrotta<https://github.com/perrotta>
Should be able to submit the PR in question around noon...
—
Reply to this email directly, view it on GitHub<#43582 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGMZOVIMBU2NJS22C5UCCTZ2I46HAVCNFSM6AAAAABAXQQUGOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJWGE4DINRWGI>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
what one can do, and will likely exhibit the same issue deterministically is to run the DIGI step of the MB, since anything that is failing during the DIGI step of the premix sample is due to the MB content. |
...In the meantime (just in case) ZDC Digitizer removal submitted to 14_0_X #46282 |
Hello, I have some very basic questions, sorry if they were already addressed and I missed it. I will list them below. I think it could also help us understand the overall picture of what is going on.
|
I'd rather have #46282 to be on the safe side for 14.0 usability in a short time. |
Ah sorry, you are correct. The correct link is now available here: #46300 |
In the PR #43576 a production of ZDC hits is enabled, which make a problem for hlt_mc_HIon addOn test. Temporary ZDC hits are masked until the problem will be solved.
The text was updated successfully, but these errors were encountered: