-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[riscv] Update pytorch to version 2.4.0 #9293
Conversation
please test |
A new Pull Request was created by @iarspider for branch IB/CMSSW_14_1_X/master. @aandvalenzuela, @iarspider, @smuzaffar can you please review it and eventually sign? Thanks. |
cms-bot internal usage |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-fd80f0/40328/summary.html External BuildI found compilation error when building: -- Added CUDA NVCC flags for: -gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_89,code=sm_89;-gencode;arch=compute_90,code=sm_90 -- Found Torch: /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/lib/libtorch.so -- Configuring incomplete, errors occurred! error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.BeUGR9 (%build) RPM build errors: line 37: It's not recommended to have unversioned Obsoletes: Obsoletes: external+pytorch-scatter+2.1.2-259c490acc9d0c047893f5c74b3cb7a4 Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.BeUGR9 (%build) |
please test |
Pull request #9293 was updated. |
@iarspider , please check the pytorch logs, looks like ROCM was not enabled properly. Either see the package doc on what requires to enable it or go through the cmake configuration ... may be we are missing some env to rocm distribution?
|
-1 Failed Tests: Build BuildI found compilation error when building: from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/torch/csrc/autograd/autograd.h:3, from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/torch/csrc/api/include/torch/autograd.h:3, from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/torch/csrc/api/include/torch/all.h:7, from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/torch/csrc/api/include/torch/torch.h:3, from src/PhysicsTools/PythonAnalysis/test/testTorch.cc:2: /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/c10/util/typeid.h:311:1: error: missing braces around initializer for 'std::__array_traits::_Type' {aka 'unsigned char [38]'} [-Werror=missing-braces] 311 | }; | ^ In file included from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/ATen/core/Dict.h:8, from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/ATen/core/ivalue_inl.h:8, from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc12/external/pytorch/2.3.1-6771a7c6e591586d225837d0ec8eb1c4/include/ATen/core/ivalue.h:1555, |
Looks like you really can't use both CUDA and ROCm at the same time: https://github.com/pytorch/pytorch/blob/v2.3.1/aten/CMakeLists.txt#L71-L73 . |
ah ok. By the way, for this build you have both CUDA and ROCM ON, so any idea why cmake did not fail with this error
something must have disabled rocm before cmake tried to configure |
Yes - it didn't find our installation of ROCm (there are at least two environment variables that need to be set) |
please test |
Pull request #9293 was updated. |
@iarspider , can you please open a separate PR to build pytorch with ROCM (i.e. disable CUDA and enable ROCM) ? |
@smuzaffar will do. |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-fd80f0/40595/summary.html External BuildI found compilation error when building: patching file caffe2/CMakeLists.txt Hunk #1 succeeded at 1413 (offset 50 lines). patching file cmake/Dependencies.cmake Hunk #1 succeeded at 1528 (offset -304 lines). Hunk #2 succeeded at 1543 (offset -304 lines). error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.wzveUd (%prep) RPM build errors: line 37: It's not recommended to have unversioned Obsoletes: Obsoletes: external+pytorch_x86-64-v3+2.4.0-ed778d12fab7788e92d195fd7dd9a2a4 Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.wzveUd (%prep) |
Pull request #9293 was updated. |
test parameters:
|
please test |
please test for el9_amd64_gcc12 |
please test for el8_aarch64_gcc12 |
please test for el9_amd64_gcc13 |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-fd80f0/40606/summary.html External BuildI found compilation error when building: Patch #3 (pytorch-cuda-12_4): + patch --no-backup-if-mismatch -f -p1 --fuzz=0 patching file aten/src/ATen/core/boxing/impl/boxing.h Hunk #1 FAILED at 38. 1 out of 1 hunk FAILED -- saving rejects to file aten/src/ATen/core/boxing/impl/boxing.h.rej error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.AUekCg (%prep) RPM build warnings: line 37: It's not recommended to have unversioned Obsoletes: Obsoletes: external+pytorch+2.4.0-35478613216697e41ba7cd6a3fbcfa71 RPM build errors: |
-1 Failed Tests: UnitTests Unit TestsI found 1 errors in the following unit tests: ---> test TestIOPoolInputNoParentDictionary had ERRORS |
+externals looks good |
This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_14_1_X/master IBs after it passes the integration tests. This pull request will now be reviewed by the release team before it's merged. @mandrenguyen, @rappoccio, @sextonkennedy, @antoniovilela (and backports should be raised in the release meeting by the corresponding L2) |
-1 Failed Tests: UnitTests Unit TestsI found 1 errors in the following unit tests: ---> test TestIOPoolInputNoParentDictionary had ERRORS Comparison SummarySummary:
|
-1 Failed Tests: UnitTests Unit TestsI found 1 errors in the following unit tests: ---> test TestIOPoolInputNoParentDictionary had ERRORS Comparison SummarySummary:
|
I will remove unused patches later, once tests pass.