mpiCudaGeneric is a program transfer data from a machine to another with GPU using MPI and CUDA. #37309

AliinCern · 2022-03-22T18:11:23Z

Program description:

In this program, we transfer data from one machine to GPU remotely in different approaches using MPI and CUDA.

Program Validation:

We have successfully compiled it with scram b runtests

Program Mechanism:

There are three approaches of transferring data, we call them parts:
Part 1: Transfer data from Root pageable memory to Host pageable memory using MPI, then allocate memory in GPU using
cudaMalloc. Finally, transfer data from Host pageable memory to GPU using cudaMemcpy.
Part 2: Transfer data from Root pageable memory to Host Pinned memory using MPI and cudaMallocHost, then allocate
memory in GPU using cudaMalloc. Finally, transfer data from Host Pinned memory to GPU using cudaMemcpy.
Part 3: Allocate memory in GPU using cudaMalloc, then transfer data from Root pageable memory to GPU memory
using MPI.

Program Measurements:

There are seven sections that we have measure time elapse:

Time preparation from Root To Host.
Time preparation from Host to GPU.
Time operation on GPU, measured by Root.
Time operation on GPU, measured by Host.
Time operation on GPU, measured by Device.
Time preparation from Host to Root.

Program command line options are:

[-np] number of processes or processors that you would like to run.
[-s] size of vectors that you would like to send, the type is float and there are two vectors.
[-t] number of repeating task on the Device(GPU) side.
[-a] number of repeating the part.
[-p] choice of what part to run in the program.
[-q] print Stander Deviation.
[-f] save the results into a file for each part.
[-h] for help.

…then to the GPU using MPI and CUDA.

cmsbuild · 2022-03-22T18:27:21Z

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37309/28947

This PR adds an extra 20KB to repository

Code check has found code style and quality issues which could be resolved by applying following patch(s)

code-format:
https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37309/28947/code-format.patch
e.g. curl -k https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37309/28947/code-format.patch | patch -p1
You can also run scram build code-format to apply code format directly

fwyzard · 2022-04-05T12:49:02Z

@cmsbot, please test

fwyzard · 2022-04-05T12:51:39Z

@cmsbot, please test

cmsbuild · 2022-04-05T12:58:17Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37309/29163

This PR adds an extra 20KB to repository

cmsbuild · 2022-04-05T12:58:42Z

A new Pull Request was created by @AliinCern (Marafi) for master.

It involves the following packages:

HeterogeneousCore/CUDACore (heterogeneous)

@cmsbuild, @makortel, @fwyzard can you please review it and eventually sign? Thanks.
@makortel, @rovere this is something you requested to watch as well.
@perrotta, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

fwyzard · 2022-04-05T13:11:48Z

@cmsbuild, please test

cmsbuild · 2022-04-05T16:38:38Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ef83a2/23658/summary.html
COMMIT: 12a0524
CMSSW: CMSSW_12_4_X_2022-04-05-1100/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/37309/23658/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 2 differences found in the comparisons
DQMHistoTests: Total files compared: 48
DQMHistoTests: Total histograms compared: 3593039
DQMHistoTests: Total failures: 8
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3593009
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 47 files compared)
Checked 200 log files, 45 edm output root files, 48 DQM output files
TriggerResults: no differences found

fwyzard · 2022-04-18T10:17:30Z

+heterogeneous

cmsbuild · 2022-04-18T10:17:50Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

qliphy · 2022-04-18T15:39:51Z

+1

mpiCudaGeneric is a program transfer data from a machine to another, …

426803d

…then to the GPU using MPI and CUDA.

cmsbuild added this to the CMSSW_12_4_X milestone Mar 22, 2022

cmsbuild added code-checks-pending heterogeneous-pending orp-pending pending-signatures tests-pending labels Mar 22, 2022

cmsbuild added code-checks-rejected and removed code-checks-pending labels Mar 22, 2022

Appply code formatting

12a0524

AliinCern force-pushed the ROOT_MPI_HOST_CUDA branch from 7ee8239 to 12a0524 Compare April 5, 2022 12:51

cmsbuild added code-checks-pending and removed code-checks-rejected labels Apr 5, 2022

cmsbuild added code-checks-approved and removed code-checks-pending labels Apr 5, 2022

cmsbuild added tests-started and removed tests-pending labels Apr 5, 2022

cmsbuild added tests-approved and removed tests-started labels Apr 5, 2022

cmsbuild added fully-signed and removed pending-signatures heterogeneous-pending labels Apr 18, 2022

cmsbuild added the heterogeneous-approved label Apr 18, 2022

cmsbuild added orp-approved and removed orp-pending labels Apr 18, 2022

cmsbuild merged commit 506a205 into cms-sw:master Apr 18, 2022

cmsbuild mentioned this pull request Apr 19, 2022

Move framework test modules away from legacy types #37609

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mpiCudaGeneric is a program transfer data from a machine to another with GPU using MPI and CUDA. #37309

mpiCudaGeneric is a program transfer data from a machine to another with GPU using MPI and CUDA. #37309

AliinCern commented Mar 22, 2022 •

edited

Loading

cmsbuild commented Mar 22, 2022

fwyzard commented Apr 5, 2022

fwyzard commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

fwyzard commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

fwyzard commented Apr 18, 2022

cmsbuild commented Apr 18, 2022

qliphy commented Apr 18, 2022

mpiCudaGeneric is a program transfer data from a machine to another with GPU using MPI and CUDA. #37309

mpiCudaGeneric is a program transfer data from a machine to another with GPU using MPI and CUDA. #37309

Conversation

AliinCern commented Mar 22, 2022 • edited Loading

Program description:

Program Validation:

Program Mechanism:

Program Measurements:

Program command line options are:

cmsbuild commented Mar 22, 2022

fwyzard commented Apr 5, 2022

fwyzard commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

fwyzard commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

Comparison Summary

fwyzard commented Apr 18, 2022

cmsbuild commented Apr 18, 2022

qliphy commented Apr 18, 2022

AliinCern commented Mar 22, 2022 •

edited

Loading