Skip to content

Commit

Permalink
Merge pull request #88 from lanl/spatter-atomics
Browse files Browse the repository at this point in the history
Spatter atomics
  • Loading branch information
JDTruj2018 authored Mar 15, 2024
2 parents e54b10c + 5779dad commit 88c9d6e
Show file tree
Hide file tree
Showing 6 changed files with 33 additions and 31 deletions.
1 change: 0 additions & 1 deletion .gitmodules
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,6 @@
[submodule "microbenchmarks/spatter"]
path = microbenchmarks/spatter
url = [email protected]:lanl/spatter.git
branch = main
[submodule "miniem_build/spack"]
path = miniem_build/spack
url = [email protected]:spack/spack
Expand Down
7 changes: 5 additions & 2 deletions doc/sphinx/09_Microbenchmarks/M2_SPATTER/SPATTER.rst
Original file line number Diff line number Diff line change
Expand Up @@ -170,6 +170,7 @@ The scripts/scaling.sh script has the following options (a scripts/mpirunscaling
* n: User-defined run name (for saving results)
* c: Core binding (optional, default: off)
* g: Toggle GPU (optional, default: off)
* m: Toggle Atomics (optional, default: off)
* r: Toggle count parameter on pattern with countlist (default: off)
* s: Toggle pattern size limit (optional, default: off for weak scaling, will be overridden to on for strong scaling)
* t: Toggle throughput plot generation (optional, default: off)
Expand Down Expand Up @@ -279,16 +280,18 @@ xRAGE Asteroid Spatter Pattern 9

Throughput experiment for the pattern in patterns/xrage/asteroid/spatter9.json. Results will be found in spatter.strongscaling/H100/xrage/asteroid/spatter9/ and Figures will be found in figures/spatter.strongscaling/H100/xrage/asteroid/spatter9/

Note that we need to enable atomics with the `-m` flag since this is a scatter pattern which overwrites the same location multiple times. Results with and without atomics are included, but the results with atomics enabled is the benchmarked performance number of importance.

.. code-block:: bash
bash scripts/scaling.sh -a xrage -p asteroid -f spatter9 -n H100 -g -s -r -t
bash scripts/scaling.sh -a xrage -p asteroid -f spatter9 -n H100 -g -s -r -t -m
..
.. csv-table:: Spatter Throughput (MB/s) on H100 xRAGE Asteroid Pattern 9
:file: h100_throughput_asteroid_9.csv
:align: center
:widths: 5, 5
:widths: 5, 5, 5
:header-rows: 1

.. figure:: h100_throughput_asteroid_9.png
Expand Down
4 changes: 2 additions & 2 deletions doc/sphinx/09_Microbenchmarks/M2_SPATTER/h100.gp
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ set xlabel "Data Transferred (MB)"
set ylabel "Figure of Merit (Total Bandwidth MB/s)"

set xrange [4:10000]
set nokey
set key center bottom

set logscale x 2

Expand All @@ -29,4 +29,4 @@ plot "h100_throughput_asteroid_5.csv" using 1:2 with linespoints linestyle 1

set output "h100_throughput_asteroid_9.png"
set ylabel "Figure of Merit (Total Bandwidth MB/s)"
plot "h100_throughput_asteroid_9.csv" using 1:2 with linespoints linestyle 1
plot "h100_throughput_asteroid_9.csv" using 1:2 with linespoints linestyle 1, "" using 1:3 with linespoints linestyle2
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
Data Transferred (MB),Total Bandwidth (MB/s)
4.194304,704688.15
8.388608,1074360.68
16.777216,1464491.66
33.554432,1555750.72
66.951744,1591058.55
133.903488,1686611.83
267.806976,1757448.15
535.613952,1781199.94
1071.227904,1806381.99
2142.455808,1816133.04
4284.911616,1825467.10
8569.823232,1825579.10
4.194304,728177.77
8.388608,1083239.69
16.777216,1476867.62
33.554432,1730323.39
66.951744,1629471.97
133.903488,1743534.92
267.806976,1821320.63
535.613952,1861425.36
1071.227904,1882041.47
2142.455808,1893056.87
4284.911616,1898720.78
8569.823232,1901308.97
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
Data Transferred (MB),Total Bandwidth (MB/s)
4.194304,598502.3
8.388608,851116.92
16.777216,890132.42
33.554432,910222.19
53.314432,974313.46
106.628864,1026540.96
213.257728,1055982.28
426.515456,1072811.37
853.030912,1082922.36
1706.061824,1084706.97
3412.123648,1088388.35
6824.247296,1090118.62
Data Transferred (MB),Total Bandwidth without Atomics (MB/s),Total Bandwidth with Atomics (MB/s)
4.194304,615361.48,492751.9
8.388608,916587.43,677374.66
16.777216,1044398.45,786038.96
33.554432,1025000.99,741567.16
53.314432,1098994.71,792614.68
106.628864,1167128.51,831375.29
213.257728,1200991.87,853850.6
426.515456,1229803.27,866450.52
853.030912,1239870.47,872005.75
1706.061824,1243804.45,875384.74
3412.123648,1246581.74,876456.83
6824.247296,1248311.13,876824.44
2 changes: 1 addition & 1 deletion microbenchmarks/spatter

0 comments on commit 88c9d6e

Please sign in to comment.