-
Notifications
You must be signed in to change notification settings - Fork 184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BuildRules] Enable Alpaka/Rocm backend #8301
Conversation
A new Pull Request was created by @smuzaffar (Malik Shahzad Muzaffar) for branch IB/CMSSW_13_0_X/master. @cmsbuild, @smuzaffar, @aandvalenzuela, @iarspider can you please review it and eventually sign? Thanks. |
test parameters:
|
please test |
-1 Failed Tests: ClangBuild The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
You can see more details here: Clang BuildI found compilation error while trying to compile with clang. Command used:
>> Entering Package RecoPixelVertexing/PixelVertexFinding >> Entering Package RecoTauTag/HLTProducers >> Entering Package RecoTracker/TkSeedGenerator >> Entering Package FWCore/Version >> Compile sequence completed for CMSSW CMSSW_13_0_X_2023-02-08-1100 gmake: *** [There are compilation/build errors. Please see the detail log above.] Error 1 + eval scram build outputlog '&&' '(python3' /data/cmsbld/jenkins/workspace/ib-run-pr-tests/cms-bot/buildLogAnalyzer.py --logDir /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_13_0_X_2023-02-08-1100/tmp/el8_amd64_gcc11/cache/log/src '||' 'true)' ++ scram build outputlog >> Entering Package Alignment/OfflineValidation >> Compiling /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_13_0_X_2023-02-08-1100/src/Alignment/OfflineValidation/bin/DMRmerge.cc >> Compiling /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_13_0_X_2023-02-08-1100/src/Alignment/OfflineValidation/bin/Options.cc |
please test |
-1 Failed Tests: RelVals RelVals-INPUT AddOn RelVals
RelVals-INPUT
Expand to see more relval errors ...
AddOn Tests
Expand to see more addon errors ... |
Now I am confused :-( The same workflows work for me using a release area created with |
I can reproduce the error if I artificially set |
Ah, it's because the |
Ah, thanks for the fix.
Any idea why it worked for me on a machine without any GPUs, and on a machine where I set explicitly |
The fix is in cms-sw/cmssw#40736
I have no idea. |
please test |
Pull request #8301 was updated. |
Pull request #8301 was updated. |
please test with cms-sw/cmssw#40832 |
please test with cms-sw/cmssw#40832 for el8_ppc64le_gcc11 |
-1 Failed Tests: Build The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic: You can see more details here: BuildI found compilation error when building: >> Cuda Device Link tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelCudaAsync/alpakaTestKernelCudaAsync_cudadlink.o >> Building alpaka/cuda binary alpakaTestKernelCudaAsync Copying tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelCudaAsync/alpakaTestKernelCudaAsync to productstore area: >> Building alpaka/rocm binary alpakaTestKernelROCmAsync /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc11/external/gcc/11.2.1-f9b9dfdd886f71cd63f5538223d8f161/bin/../lib/gcc/x86_64-redhat-linux-gnu/11.2.1/../../../../x86_64-redhat-linux-gnu/bin/ld: cannot find tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelROCmAsync/alpaka/testKernel.dev.cc.o: No such file or directory collect2: error: ld returned 1 exit status >> Deleted: tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelROCmAsync/alpakaTestKernelROCmAsync gmake: *** [tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelROCmAsync/alpakaTestKernelROCmAsync] Error 1 >> Compiling alpaka/serial /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_13_1_X_2023-02-26-2300/src/HeterogeneousCore/AlpakaInterface/test/alpaka/testKernel.dev.cc >> Building alpaka/serial binary alpakaTestKernelSerialSync Copying tmp/el8_amd64_gcc11/src/HeterogeneousCore/AlpakaInterface/test/alpakaTestKernelSerialSync/alpakaTestKernelSerialSync to productstore area: |
-1 Failed Tests: UnitTests The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
You can see more details here: Unit TestsI found errors in the following unit tests: ---> test testFWCoreUtilities had ERRORS ---> test testONNXRuntime had ERRORS |
please test |
@smuzaffar @perrotta @rappoccio, now that 13.1.0-pre1 is out, can we merge this in 13.1.x, and backport it to 13.0.0 ? |
(all test failures were unrelated to this PR, let's see if they go away re-running on a more recent IB) |
please test for el8_ppc64le_gcc11 |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-fb8f22/30987/summary.html External BuildI found compilation error when building: FATAL: malformed spec found while quering it. Command: source /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc11/rpm-env.sh ; rpm -q --specfile /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/tmpspec-coral --info --define "cmsdist_directory /data/cmsbld/jenkins/workspace/ib-run-pr-tests/cmsdist" --define "compilerv 1121" --define "cmscompilerv 11" --define "cmsos el8_amd64" --define "almalinux_ver 8" --define "almalinux 8" --define "centos_ver 8" --define "centos 8" --define "rhel 8" --define "dist .el8" --define "el8 1" --define "package_vectorization %{nil}" --define "cmsswdata_version_link 1" --define 'buildroot /foo' Resulted in: warning: line 30: It's not recommended to have unversioned Obsoletes: Obsoletes: cms+coral+CORAL_2_3_21 error: line 417: Unknown tag: <<<<<<< HEAD error: query of specfile /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/tmpspec-coral failed, can't parse Traceback (most recent call last): File "./pkgtools/cmsBuild", line 4610, in build(opts, args[1:], PKGFactory) File "./pkgtools/cmsBuild", line 3875, in build |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-fb8f22/30988/summary.html External BuildI found compilation error when building: FATAL: malformed spec found while quering it. Command: source /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/el8_ppc64le_gcc11/rpm-env.sh ; rpm -q --specfile /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/tmp/tmpspec-coral --info --define "cmsdist_directory /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/cmsdist" --define "compilerv 1121" --define "cmscompilerv 11" --define "cmsos el8_ppc64le" --define "almalinux_ver 8" --define "almalinux 8" --define "centos_ver 8" --define "centos 8" --define "rhel 8" --define "dist .el8" --define "el8 1" --define "package_vectorization %{nil}" --define "cmsswdata_version_link 1" --define 'buildroot /foo' Resulted in: warning: line 30: It's not recommended to have unversioned Obsoletes: Obsoletes: cms+coral+CORAL_2_3_21 error: line 417: Unknown tag: <<<<<<< HEAD error: query of specfile /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/tmp/tmpspec-coral failed, can't parse Traceback (most recent call last): File "./pkgtools/cmsBuild", line 4610, in build(opts, args[1:], PKGFactory) File "./pkgtools/cmsBuild", line 3875, in build |
ehm, what ? |
I presume this
means there is a conflict somewhere... I've opened #8346 with the same diff but a clean commit history. |
problem here is that it is also using a cmssw PR for 13.0.X ( #8301 (comment) ) due to which bot tried to use 13.0.X cmsdist branch ( CMSSW_13_0_X_2023-02-28-1500/el8_ppc64le_gcc11 ) |
No description provided.