Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45689

stahlleiton · 2024-08-12T15:19:52Z

PR description:

This PR fixes an issue found in the UnifiedParticleTransformer producer, that relates to the index used to access the sorting list of lost tracks. The indices of the lt_sortedindices vector correspond to the indices of the original collection (e.g. LTs collection), however in the second for loop, the lt_sortedindices is accessed using the index of the lt_sorted vector (which has been trimmed and is smaller than the LTs collection). To get back the correct index to access the elements of the lt_sortedindices one should use the "get()" function from the lt_sorted vector, which returns the index of the LTs collection.

@SWuchterl @DickyChant

PR validation:

Tested locally by comparing the features produced by the CMSSW producer to those used by the b-hive inference.

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

…Info producer

cmsbuild · 2024-08-12T15:20:19Z

cms-bot internal usage

cmsbuild · 2024-08-12T15:21:33Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45689/41251

cmsbuild · 2024-08-12T15:21:54Z

A new Pull Request was created by @stahlleiton for master.

It involves the following packages:

RecoBTag/FeatureTools (reconstruction)

@cmsbuild, @jfernan2, @mandrenguyen can you please review it and eventually sign? Thanks.
@AlexDeMoor, @Ming-Yan, @Senphy, @andrzejnovak, @castaned, @demuller, @hqucms, @missirol this is something you requested to watch as well.
@antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

mandrenguyen · 2024-08-12T21:52:09Z

type bug-fix

mandrenguyen · 2024-08-12T21:53:46Z

please test

stahlleiton · 2024-08-12T22:14:57Z

Since the unified particle transformer model has already been trained for 2024 pp, I made a commit to add a flag to enable the bug fix. This flag is set to false by default to avoid affecting central production.

cmsbuild · 2024-08-12T22:17:46Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45689/41255

cmsbuild · 2024-08-12T22:17:52Z

Pull request #45689 was updated. @cmsbuild, @jfernan2, @mandrenguyen can you please check and sign again.

mandrenguyen · 2024-08-12T22:23:42Z

please test

cmsbuild · 2024-08-13T01:29:11Z

+1

Size: This PR adds an extra 20KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0065ce/40884/summary.html
COMMIT: a5dd705
CMSSW: CMSSW_14_1_X_2024-08-12-1100/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/45689/40884/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 11 differences found in the comparisons
DQMHistoTests: Total files compared: 45
DQMHistoTests: Total histograms compared: 3422822
DQMHistoTests: Total failures: 12
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3422790
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 44 files compared)
Checked 196 log files, 165 edm output root files, 45 DQM output files
TriggerResults: no differences found

AlexDeMoor · 2024-08-13T08:38:12Z

Would it be possible to have documentation regarding this bugfix? More specifically, it would be very useful to understand this one and its impact.

Also, the same ordering is used by the cpf and npf candidates from most of the btagger since DeepJet

stahlleiton · 2024-08-13T10:35:27Z

Would it be possible to have documentation regarding this bugfix? More specifically, it would be very useful to understand this one and its impact.

Also, the same ordering is used by the cpf and npf candidates from most of the btagger since DeepJet

The bug is not present for the charged and neutral PF candidates, since in those two cases, the second for loop (where the sorted list of indices (c_sortedindices and n_sortedindices) are used) is looped over the same collection (jet constituents) as the for loop that filled the sorted vector. So the indices are consistent.

However, in the case of lost tracks, the second for loop is done over a different collection (sorted vector) than the one used to fill the sorted vector (lost track collection), which introduces the issue (wrong index used to access the sorted list of indices (lt_sortedindices)).

The main consequence of using the wrong index when accessing the sorted list of indices is that one can either get the default value returned (i.e. index 0) or a wrong value back, so the lost tracks are either assigned to entry 0 of the features vector multiple times (thus it stores less lost tracks in the features vector than elements in the sorted vector) or to another position in the features vector where they should not.

So since the lost tracks are only used in the unified particle transformer model (as far as I could see), the other models are not affected by this issue.

mandrenguyen · 2024-08-13T11:14:48Z

@AlexDeMoor Are you satisfied with this explanation/solution?

mandrenguyen · 2024-08-14T11:17:14Z

+1

cmsbuild · 2024-08-14T11:17:38Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will be automatically merged.

AlexDeMoor · 2024-08-14T11:27:32Z

We have followed the discussion offline about some details of this bugfix and the potential impact of it. For now, everything is ok on the tagger side and we will have further evaluation later. This bugfix is not affecting UParT for the 2024 data giving us time to ensure the lost tracks are totally ok.

mandrenguyen · 2024-08-14T12:19:17Z

type btv

Fix sorting index for lost tracks in UnifiedParticleTransformerAK4Tag…

a03668f

…Info producer

cmsbuild added this to the CMSSW_14_1_X milestone Aug 12, 2024

cmsbuild added reconstruction-pending pending-signatures tests-pending orp-pending code-checks-pending labels Aug 12, 2024

cmsbuild added code-checks-approved and removed code-checks-pending labels Aug 12, 2024

cmsbuild added the bug-fix label Aug 12, 2024

cmsbuild added tests-started and removed tests-pending labels Aug 12, 2024

Add a flag to enable the LT sorting bug fix

a5dd705

cmsbuild added tests-pending code-checks-pending and removed tests-started code-checks-approved labels Aug 12, 2024

cmsbuild added code-checks-approved and removed code-checks-pending labels Aug 12, 2024

cmsbuild added tests-started and removed tests-pending labels Aug 12, 2024

cmsbuild added tests-approved and removed tests-started labels Aug 13, 2024

cmsbuild added reconstruction-approved fully-signed orp-approved and removed reconstruction-pending pending-signatures orp-pending labels Aug 14, 2024

cmsbuild merged commit a961243 into cms-sw:master Aug 14, 2024
11 checks passed

stahlleiton mentioned this pull request Aug 14, 2024

[13_2_X] Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45700

Merged

stahlleiton deleted the Fix_UParT_CMSSW_14_1_X branch August 14, 2024 12:18

cmsbuild added the btv label Aug 14, 2024

This was referenced Aug 14, 2024

use eigen3 header instead of tf third_party/eigen3 #45701

Merged

[TF] Update TF v2.16.1 cms-sw/cmsdist#9241

Merged

stahlleiton mentioned this pull request Aug 31, 2024

[14_0_X] Add option to sort candidates by pT in Unified Particle Transformer and PR 45689 #45847

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45689

Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45689

stahlleiton commented Aug 12, 2024 •

edited

Loading

cmsbuild commented Aug 12, 2024 •

edited

Loading

cmsbuild commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

stahlleiton commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

cmsbuild commented Aug 13, 2024

AlexDeMoor commented Aug 13, 2024

stahlleiton commented Aug 13, 2024 •

edited

Loading

mandrenguyen commented Aug 13, 2024

mandrenguyen commented Aug 14, 2024

cmsbuild commented Aug 14, 2024

AlexDeMoor commented Aug 14, 2024

mandrenguyen commented Aug 14, 2024

Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45689

Fix sorting index for lost tracks in UnifiedParticleTransformer producer #45689

Conversation

stahlleiton commented Aug 12, 2024 • edited Loading

PR description:

PR validation:

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

cmsbuild commented Aug 12, 2024 • edited Loading

cmsbuild commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

stahlleiton commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

cmsbuild commented Aug 12, 2024

mandrenguyen commented Aug 12, 2024

cmsbuild commented Aug 13, 2024

Comparison Summary

AlexDeMoor commented Aug 13, 2024

stahlleiton commented Aug 13, 2024 • edited Loading

mandrenguyen commented Aug 13, 2024

mandrenguyen commented Aug 14, 2024

cmsbuild commented Aug 14, 2024

AlexDeMoor commented Aug 14, 2024

mandrenguyen commented Aug 14, 2024

stahlleiton commented Aug 12, 2024 •

edited

Loading

cmsbuild commented Aug 12, 2024 •

edited

Loading

stahlleiton commented Aug 13, 2024 •

edited

Loading