First version of DeepMET #29764

steggema · 2020-05-07T16:24:16Z

PR description:

This PR introduces a producer for DeepMET, a deep-learning-based missing pT estimator. The producer creates a new MET collection, and a test configuration is included. The plan is to create a separate PR for possible inclusion in central sequences, e.g. for the upcoming ReMiniAOD campaign.

The tensorflow models for this PR are proposed in a separate PR to cms-data/RecoMET-METPUSubtraction cms-data/RecoMET-METPUSubtraction#5

There are different trainings for different years/conditions (2016, 2018, phase 2), and for 2018 and phase 2 also non-response-corrected trainings.

Presentations:
https://indico.cern.ch/event/912067/contributions/3835851/ (most recent update)
https://indico.cern.ch/event/883809/contributions/3733818/ (CMS week JetMET meeting)
https://indico.cern.ch/event/854654/contributions/3594579/ (first presentation in MET meeting)

Note that an alternative implementation would be to store additional weights for each PFCandidate and calculate MET and possibly jets in a subsequent step. However, we prefer to leave this option to future studies given that we have not checked the performance on jets (and assume some non-trivial effects) and that this would lead to an increase in complexity of the integration and the additional storage required, in particular if we want to have different METs (e.g. response and non-response-corrected) for a single campaigns.

PR validation:

The code has been validated by running it in large-scale checks with simulated and data events to evaluate the performance of the algorithm. A test configuration is included. We have not run any memory or timing tests but we suspect that it runs fast and consumes little memory given the models are small compared to most other tensorflow models.

CPU and memory reports can be found under the following links, obtained on 1000 events from the 136.8311_RunJetHT2017F workflow:
https://steggema.web.cern.ch/steggema/cgi-bin/igprof-navigator/deepmet_cpu_reminiaod
https://steggema.web.cern.ch/steggema/cgi-bin/igprof-navigator/deepmet_mem_reminiaod (note that DeepMET does not seem to appear here, supposedly because it's not in the top 1000, see the text file linked below)

Text dumps are also available here: /afs/cern.ch/user/s/steggema/public/DeepMETIntegration/

if this PR is a backport please specify the original PR and why you need to backport that PR:

@intrepid42 @yongbinfeng

cmsbuild · 2020-05-07T16:24:39Z

The code-checks are being triggered in jenkins.

cmsbuild · 2020-05-07T16:32:26Z

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-29764/15211

This PR adds an extra 12KB to repository

Code check has found code style and quality issues which could be resolved by applying following patch(s)

code-format:
https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-29764/15211/code-format.patch
e.g. curl https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-29764/15211/code-format.patch | patch -p1
You can also run scram build code-format to apply code format directly

slava77 · 2020-05-07T16:49:48Z

please provide some estimates of CPU time and memory use.

Other than just enabling the code in mini/nanoAOD, what else needs to happen (more developments, retraining)?
If nothing else, I think it would be better to add this now at least to the miniAOD step.

steggema · 2020-05-08T12:53:15Z

please provide some estimates of CPU time and memory use.

Will be added.

Other than just enabling the code in mini/nanoAOD, what else needs to happen (more developments, retraining)?
If nothing else, I think it would be better to add this now at least to the miniAOD step.

We don't need and foresee any additional developments or training at the moment (except for uncertainty estimates, which are however a completely different matter both content- and code-wise), so we'll add this to the MiniAOD step if you prefer to have this included in this PR.

Given other commitments, I expect an update by sometime next week.

slava77 · 2020-05-08T15:08:02Z

Given other commitments, I expect an update by sometime next week.

OK
Please note

Code check has found code style and quality issues which could be resolved by applying following patch(s)

the PR tests can not proceed without this addressed

cmsbuild · 2020-05-11T13:28:06Z

The tests are being triggered in jenkins.
Tested with other pull request(s) cms-data/RecoMET-METPUSubtraction#5
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/6220/console Started: 2020/05/11 15:31

cmsbuild · 2020-05-11T15:02:41Z

+1
Tested at: fc76ca5
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d3fcda/6220/summary.html
CMSSW: CMSSW_11_1_X_2020-05-11-1100
SCRAM_ARCH: slc7_amd64_gcc820

cmsbuild · 2020-05-11T15:02:45Z

Comparison job queued.

cmsbuild · 2020-05-18T15:50:43Z

+1
Tested at: 74e15da
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-f1bf30/6391/summary.html
CMSSW: CMSSW_11_1_X_2020-05-18-1100
SCRAM_ARCH: slc7_amd64_gcc820

cmsbuild · 2020-05-18T15:50:49Z

Comparison job queued.

cmsbuild · 2020-05-18T17:19:44Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-f1bf30/6391/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 50 differences found in the comparisons
DQMHistoTests: Total files compared: 35
DQMHistoTests: Total histograms compared: 2694466
DQMHistoTests: Total failures: 3
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2694414
DQMHistoTests: Total skipped: 49
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 34 files compared)
Checked 150 log files, 16 edm output root files, 35 DQM output files

slava77 · 2020-05-18T17:49:27Z

Still, considering that the implementation really does not return anything more than px,py, would it be more practical to insert the DeepMETs in slimmedMETsPuppi or even slimmedMETs in the same way how Calo and CHS METs are done?

I had in mind this

cmssw/PhysicsTools/PatAlgos/plugins/PATMETSlimmer.cc

Lines 80 to 82 in 424ad43

maybeReadShifts(iConfig, "caloMET", pat::MET::Calo);

maybeReadShifts(iConfig, "chsMET", pat::MET::Chs);

maybeReadShifts(iConfig, "trkMET", pat::MET::Trk);

I've updated using this solution. Note that the PATMETSlimmer is apparently also always re-run in NanoAOD, so I switched adding the DeepMET variants off by default and only add them now in regular MiniAOD processing after a quick exchange with @peruzzim . Adding the DeepMET px and py values to NanoAOD will require a bit of fiddling with the according sequence in Nano.

@ahinzmann @lathomas please clarify if this works OK for JME.
Thank you.

ahinzmann · 2020-05-18T20:29:20Z

Still, considering that the implementation really does not return anything more than px,py, would it be more practical to insert the DeepMETs in slimmedMETsPuppi or even slimmedMETs in the same way how Calo and CHS METs are done?

I had in mind this

cmssw/PhysicsTools/PatAlgos/plugins/PATMETSlimmer.cc

Lines 80 to 82 in 424ad43

maybeReadShifts(iConfig, "caloMET", pat::MET::Calo);

maybeReadShifts(iConfig, "chsMET", pat::MET::Chs);

maybeReadShifts(iConfig, "trkMET", pat::MET::Trk);

I've updated using this solution. Note that the PATMETSlimmer is apparently also always re-run in NanoAOD, so I switched adding the DeepMET variants off by default and only add them now in regular MiniAOD processing after a quick exchange with @peruzzim . Adding the DeepMET px and py values to NanoAOD will require a bit of fiddling with the according sequence in Nano.

@ahinzmann @lathomas please clarify if this works OK for JME.
Thank you.

Yes, that's fine. As long as the fiddling with the sequence for JME-extended-Nano is doable based on this PR for MiniAOD that's fine.

slava77 · 2020-05-18T20:56:09Z

+1

for #29764 74e15da

code changes are in line with the PR description and the follow up review
jenkins tests pass and comparisons with the baseline show differences only in the miniAOD workflows in the "corrections" of the slimmedMETs and slimmedMETsPuppi (similar to the way of storing trk, chs, and calo METs)

silviodonato · 2020-05-19T17:14:49Z

merge
@santocch

santocch · 2020-05-20T07:18:02Z

+1

cmsbuild · 2020-05-20T07:18:31Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will be automatically merged.

Backport DeepMET into CMSSW_10_6_X (original: #29764)

First version of DeepMET

fc76ca5

steggema mentioned this pull request May 7, 2020

Add DeepMET model files cms-data/RecoMET-METPUSubtraction#5

Merged

cmsbuild added this to the CMSSW_11_1_X milestone May 7, 2020

cmsbuild added code-checks-pending comparison-pending orp-pending pending-signatures reconstruction-pending tests-pending labels May 7, 2020

cmsbuild added code-checks-rejected and removed code-checks-pending labels May 7, 2020

cmsbuild added requires-external tests-started and removed tests-pending labels May 11, 2020

Code format

d1e3bd5

cmsbuild added tests-approved code-checks-pending tests-pending and removed tests-started code-checks-rejected requires-external tests-approved labels May 11, 2020

cmsbuild added the tests-started label May 18, 2020

cmsbuild added tests-approved and removed tests-started labels May 18, 2020

cmsbuild added comparison-available and removed comparison-pending labels May 18, 2020

cmsbuild added reconstruction-approved and removed reconstruction-pending labels May 18, 2020

cmsbuild added orp-approved and removed orp-pending labels May 19, 2020

cmsbuild merged commit 31315ee into cms-sw:master May 19, 2020

cmsbuild mentioned this pull request May 19, 2020

Added esConsumes to SiPixelRecHitConverter #29914

Merged

cmsbuild added analysis-approved fully-signed and removed analysis-pending pending-signatures labels May 20, 2020

fojensen mentioned this pull request Jun 14, 2020

Phase 2 Tau Isolation MVA cms-tau-pog/cmssw#139

Merged

This was referenced Jun 17, 2020

Add DeepMET into NanoAOD cms-nanoAOD/cmssw#525

Closed

Add DeepMET into NanoAOD #30291

Merged

This was referenced Jul 9, 2020

update RecoMET-METPUSubtraction data for backport DeepMET into CMSSW_10_6_X cms-sw/cmsdist#6054

Merged

Backport DeepMET into CMSSW_10_6_X (original: #29764) #30612

Merged

cmsbuild added a commit that referenced this pull request Jul 16, 2020

Merge pull request #30612 from yongbinfeng/DeepMETIntegration

263352f

Backport DeepMET into CMSSW_10_6_X (original: #29764)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First version of DeepMET #29764

First version of DeepMET #29764

steggema commented May 7, 2020 •

edited

Loading

cmsbuild commented May 7, 2020

cmsbuild commented May 7, 2020

slava77 commented May 7, 2020

steggema commented May 8, 2020

slava77 commented May 8, 2020

cmsbuild commented May 11, 2020 •

edited

Loading

cmsbuild commented May 11, 2020

cmsbuild commented May 11, 2020

cmsbuild commented May 18, 2020

cmsbuild commented May 18, 2020

cmsbuild commented May 18, 2020

slava77 commented May 18, 2020

ahinzmann commented May 18, 2020

slava77 commented May 18, 2020

silviodonato commented May 19, 2020

santocch commented May 20, 2020

cmsbuild commented May 20, 2020

First version of DeepMET #29764

First version of DeepMET #29764

Conversation

steggema commented May 7, 2020 • edited Loading

PR description:

PR validation:

if this PR is a backport please specify the original PR and why you need to backport that PR:

cmsbuild commented May 7, 2020

cmsbuild commented May 7, 2020

slava77 commented May 7, 2020

steggema commented May 8, 2020

slava77 commented May 8, 2020

cmsbuild commented May 11, 2020 • edited Loading

cmsbuild commented May 11, 2020

cmsbuild commented May 11, 2020

cmsbuild commented May 18, 2020

cmsbuild commented May 18, 2020

cmsbuild commented May 18, 2020

slava77 commented May 18, 2020

ahinzmann commented May 18, 2020

slava77 commented May 18, 2020

silviodonato commented May 19, 2020

santocch commented May 20, 2020

cmsbuild commented May 20, 2020

steggema commented May 7, 2020 •

edited

Loading

cmsbuild commented May 11, 2020 •

edited

Loading