-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix number of floats per hit #321
Fix number of floats per hit #321
Conversation
Validation summaryReference release CMSSW_10_6_0_pre2 at 1313262
|
@dsperka thanks for the fix. Indeed, this recovers the correct efficiency for the .53 workflow.
|
No observable impact on performance, measured on the 10824.53 workflow with a P100 over TTbar events. development:
testing:
|
PR description:
This PR arises from my attempt to run the Pixel Track reconstruction validation on a local machine. The problem comes from the 10824.53 workflow when running with the --tool memcheck option. In the step3-memcheck.log file there is a fatal crash:
11-Apr-2019 11:43:35 EDT Initiating request to open file file:/cms/data/store/relval/CMSSW_10_4_0_pre2/RelValTTbar_13/GEN-SIM-DIGI-RAW/PU25ns_103X_upgrade2018_realistic_v8-v1/10000/29C415
A1-48E9-8445-A19C-49B84D1505ED.root
11-Apr-2019 11:43:36 EDT Successfully opened file file:/cms/data/store/relval/CMSSW_10_4_0_pre2/RelValTTbar_13/GEN-SIM-DIGI-RAW/PU25ns_103X_upgrade2018_realistic_v8-v1/10000/29C415A1-48E9
-8445-A19C-49B84D1505ED.root
Begin processing the 1st record. Run 1, Event 8503, LumiSection 171 on stream 0 at 11-Apr-2019 11:43:44.504 EDT
%MSG-e TkDetLayers: SeedingLayersEDProducer:pixelTracksSeedLayers 11-Apr-2019 11:43:46 EDT Run: 1 Event: 8503
ForwardDiskSectorBuilderFromDet: Trying to build Petal Wedge from Dets at different z positions !! Delta_z = -0.950417
%MSG
/data/user/fwyzard/patatrack/build/slc7_amd64_gcc700.patatrack106x/tmp/BUILDROOT/4798e156c43b9007e8153f352738cf66/opt/cmssw/slc7_amd64_gcc700/cms/cmssw/CMSSW_10_6_0_pre2_Patatrack/src/Reco
PixelVertexing/PixelTrackFitting/plugins/PixelTrackReconstructionGPU.cu, line 204: cudaErrorLaunchFailure: unspecified launch failure
In the cuda-memcheck.log file there is more information:
========= CUDA-MEMCHECK
========= Invalid global read of size 4
========= at 0x000007b0 in /data/user/fwyzard/patatrack/build/slc7_amd64_gcc700.patatrack106x/tmp/BUILDROOT/4798e156c43b9007e8153f352738cf66/opt/cmssw/slc7_amd64_gcc700/cms/cmssw/CMSSW
10_6_0_pre2_Patatrack/src/RecoPixelVertexing/PixelTrackFitting/plugins/PixelTrackReconstructionGPU.cu:42:KernelFastFitAllHits(float*, int, int, float, Rfit::helix_fit*, Eigen::Matrix<doub
le, int=3, int=4, int=0, int=3, int=4>, Eigen::Matrix<float, int=6, int=4, int=0, int=6, int=4>, Rfit::circle_fit*, Eigen::Matrix<double, int=4, int=1, int=0, int=4, int=1>*, Rfit::line
fit*)
========= by thread (23,0,0) in block (11,0,0)
========= Address 0x7f59ff411940 is out of bounds
This appears to me to be a problem with the number floats assigned per hit. The code assumes 12 but there are only 9, 3 for position and 6 instead of 9 for position errors, presumably because the covariance matrix is assumed to be symmetric.
PR validation:
After this change the 10824.53 workflow when running with the --tool memcheck option runs successfully without any crash. I didn't do any comparison of performance before and after, but would be happy to if I could be pointed to which validation plots are the most relevant.