Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mask old b-taggers and keep only DNN taggers with Era modifier for run3. #39808

Merged
merged 1 commit into from
Dec 9, 2022

Conversation

yuanchao
Copy link
Contributor

@yuanchao yuanchao commented Oct 21, 2022

PR description:

Regarding the AlCa issue #58 (cms-AlCaDB/AlCaTools#58), old b-tagging GT records are request to be removed (starting) from 12_1_X (and so on). Due to the dependency on "BTauGenericMVAJetTagComputerRcd" and "JetTagComputerRecord", the corresponding b-tagging computers need to be masked in run3. As DNN taggers will be the only supported ones in run3, this PR masks the old taggers with Era modifier and keeps only DNN taggers like DeepCSV and particleNet. (track counting and jet probability are kept for reference as requested)

PR validation:

PR tested in local with runTheMatrix.py processes. However, the standard run 2 validation sequences will fail due to missing b-tagger objects.

This PR is for run 3. Not a backport and no backport planed.

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-39808/32679

  • This PR adds an extra 12KB to repository

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @yuanchao (Yuan CHAO) for master.

It involves the following packages:

  • RecoBTag/Configuration (reconstruction)

@cmsbuild, @mandrenguyen, @clacaputo can you please review it and eventually sign? Thanks.
@AlexDeMoor, @emilbols, @JyothsnaKomaragiri, @AnnikaStein, @missirol, @andrzejnovak, @demuller this is something you requested to watch as well.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.

cms-bot commands are listed here

@mandrenguyen
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/28420/summary.html
COMMIT: 807ac22
CMSSW: CMSSW_12_6_X_2022-10-20-2300/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/39808/28420/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals

----- Begin Fatal Exception 21-Oct-2022 12:04:43 CEST-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 346512 lumi: 250 event: 243048393 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MiniAODSVAnalyzer/'bTagSVDQM'
   [3] Calling method for module PATJetProducer/'patJetsSVInfo'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfJetBProbabilityBJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 21-Oct-2022 12:08:10 CEST-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 1 lumi: 1 event: 1 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MuonMiniAOD/'MuonMiniAOD'
   [3] Prefetching for module PATMuonSlimmer/'slimmedMuons'
   [4] Prefetching for module PATMuonSelector/'selectedPatMuons'
   [5] Calling method for module PATMuonProducer/'patMuons'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfCombinedInclusiveSecondaryVertexV2BJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 21-Oct-2022 12:15:13 CEST-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 1 lumi: 1 event: 1 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MiniAODSVAnalyzer/'bTagSVDQM'
   [3] Calling method for module PATJetProducer/'patJetsSVInfo'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfJetBProbabilityBJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
Expand to see more relval errors ...

@AlexDeMoor
Copy link
Contributor

Hi @yuanchao , what about JP and DeepJet ?
Those taggers are still used by the BTV group. DeepJet is still the sota model and will be kept at the beginning of Run3 and JP is used for calibration

@yuanchao
Copy link
Contributor Author

@AlexDeMoor Thank you very much for the comment. Originally I was told that DNN taggers will be the only supported ones in Run 3. Surely I can add back the JP tagger in the era modifier. As to DeepJet, it's not in the original RecoBTag_cff.py configuration file. This PR doesn't touch that.

@yuanchao
Copy link
Contributor Author

@AlexDeMoor One more thing. Does JP depend on "JetTagComputerRecord" GT?

@tvami
Copy link
Contributor

tvami commented Oct 27, 2022

assign alca

@cmsbuild
Copy link
Contributor

New categories assigned: alca

@yuanchao,@francescobrivio,@malbouis,@saumyaphor4252,@tvami,@ChrisMisan you have been requested to review this Pull request/Issue and eventually sign? Thanks

@AlexDeMoor
Copy link
Contributor

Thank you for the information @yuanchao . Kindly ping @soureek and @johnalison . Do you have the answer about the GT ("Does JP depend on "JetTagComputerRecord" GT?")

@soureek
Copy link

soureek commented Nov 3, 2022

@AlexDeMoor One more thing. Does JP depend on "JetTagComputerRecord" GT?

As mentioned by Daniel Bloch in the email thread, we don't require this.

@tvami
Copy link
Contributor

tvami commented Nov 11, 2022

@cmsbuild , please test with #40043

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/28973/summary.html
COMMIT: 807ac22
CMSSW: CMSSW_12_6_X_2022-11-11-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/39808/28973/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals

----- Begin Fatal Exception 11-Nov-2022 13:30:15 CET-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 346512 lumi: 250 event: 243048393 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MiniAODSVAnalyzer/'bTagSVDQM'
   [3] Calling method for module PATJetProducer/'patJetsSVInfo'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfJetBProbabilityBJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 11-Nov-2022 13:33:10 CET-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 1 lumi: 1 event: 1 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MuonMiniAOD/'MuonMiniAOD'
   [3] Prefetching for module PATMuonSlimmer/'slimmedMuons'
   [4] Prefetching for module PATMuonSelector/'selectedPatMuons'
   [5] Calling method for module PATMuonProducer/'patMuons'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfCombinedInclusiveSecondaryVertexV2BJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 11-Nov-2022 13:39:45 CET-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 1 lumi: 1 event: 1 stream: 0
   [1] Running path 'dqmoffline_1_step'
   [2] Prefetching for module MiniAODSVAnalyzer/'bTagSVDQM'
   [3] Calling method for module PATJetProducer/'patJetsSVInfo'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for type: edm::AssociationVector<edm::RefToBaseProd<reco::Jet>,std::vector<float>,edm::RefToBase<reco::Jet>,unsigned int,edm::helper::AssociationIdenticalKeyReference>
Looking for module label: pfJetBProbabilityBJetTags
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "SkipEvent = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------
Expand to see more relval errors ...

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/29114/summary.html
COMMIT: 8764b76
CMSSW: CMSSW_12_6_X_2022-11-18-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/39808/29114/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 1108 differences found in the comparisons
  • DQMHistoTests: Total files compared: 48
  • DQMHistoTests: Total histograms compared: 3417167
  • DQMHistoTests: Total failures: 4278
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3412867
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 47 files compared)
  • Checked 206 log files, 48 edm output root files, 48 DQM output files
  • TriggerResults: no differences found

@tvami
Copy link
Contributor

tvami commented Nov 18, 2022

+alca

  • changes are in BTV related folders which is expected

@AlexDeMoor
Copy link
Contributor

After looking at the last test (https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/29114/summary.html), it seems we have a lot of failings coming from the BTV HLT. I kindly ping @johnalison , @NiclasEich and @marco-link for their feedback on this part 👀

@cmsbuild cmsbuild modified the milestones: CMSSW_12_6_X, CMSSW_13_0_X Nov 24, 2022
@clacaputo
Copy link
Contributor

please test

refreshing the results

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 2, 2022

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/29418/summary.html
COMMIT: 8764b76
CMSSW: CMSSW_13_0_X_2022-12-01-1100/el8_amd64_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/39808/29418/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 1274 differences found in the comparisons
  • DQMHistoTests: Total files compared: 48
  • DQMHistoTests: Total histograms compared: 3421159
  • DQMHistoTests: Total failures: 3935
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3417202
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 47 files compared)
  • Checked 206 log files, 158 edm output root files, 48 DQM output files
  • TriggerResults: no differences found

@mandrenguyen
Copy link
Contributor

After looking at the last test (https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-3763cc/29114/summary.html), it seems we have a lot of failings coming from the BTV HLT. I kindly ping @johnalison , @NiclasEich and @marco-link for their feedback on this part 👀

Hello, are we still waiting for feedback from BTV HLT? @cms-sw/btv-pog-l2

@tvami
Copy link
Contributor

tvami commented Dec 8, 2022

hi @mandrenguyen I had a private communication with Soureek et al, and they said:
"We are discussing a bit with the developers how to simplify/update the access of the DNN tagger information within CMSSW. However, I'm sorry to say that this discussion will need a bit more time to come to a final conclusion leaving no loose ends. We expect to have a concrete plan by end-January, 2023. Hope this works with you."

I'd say we should let this PR in, and then let BTV remove the remaining dependencies for the HLT folder, what do you think?

@mandrenguyen
Copy link
Contributor

+reconstruction

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 8, 2022

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

@rappoccio
Copy link
Contributor

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants