Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FYI: Here are some benchmarks on my laptop #2

Open
magland opened this issue Mar 11, 2020 · 0 comments
Open

FYI: Here are some benchmarks on my laptop #2

magland opened this issue Mar 11, 2020 · 0 comments

Comments

@magland
Copy link

magland commented Mar 11, 2020

This is not an issue, just a report for your info. Below is the result of running the run_all_tests.sh script 4 times on my laptop -- Dell Precision 7530.

(This excludes MATLAB)

(weirdly if I run fortran test on its own, rather than in the context of the script, I generally get higher rate - ranging between 1.7 and 2 Gpair/sec)

vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.189 s           2.64 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.183 s           2.74 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.424 s           1.18 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.641 s           0.78 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7549471.0676897569     
   100000000 src-targ pairs in   1.26053715     s:   1.58662510     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
devec: 100000000 src-targ pairs in 0.28 s: 0.357 Gpair/s
devec par: 100000000 src-targ pairs in 0.0532 s: 1.88 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 2.04 s: 0.049 Gpair/s
numba:  100000000 src-targ pairs in 0.047 s: 2.13 Gpair/s
pot err numba vs native:  6.82e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-204.925905525712        t=1.58 s        0.0635 Gpair/s
src-outer:      u(test)=-204.925905525712        t=1.47 s        0.0681 Gpair/s
dense-matvec:   u(test)=-204.9259055257115       t=3.51 s        0.0285 Gpair/s




vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.131 s           3.82 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.137 s           3.64 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.297 s           1.68 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.711 s           0.703 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7453247.5122789210     
   100000000 src-targ pairs in   1.23374486     s:   1.62108076     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.97 s: 0.0507 Gpair/s
devec: 100000000 src-targ pairs in 0.258 s: 0.388 Gpair/s
devec par: 100000000 src-targ pairs in 0.0501 s: 2 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.97 s: 0.0508 Gpair/s
numba:  100000000 src-targ pairs in 0.0437 s: 2.29 Gpair/s
pot err numba vs native:  6.82e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-287.8825547996272       t=1.46 s        0.0683 Gpair/s
src-outer:      u(test)=-287.8825547996272       t=1.46 s        0.0687 Gpair/s
dense-matvec:   u(test)=-287.8825547996273       t=3.53 s        0.0283 Gpair/s




vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.161 s           3.11 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.17 s            2.93 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.425 s           1.18 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.663 s           0.755 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7421530.2827616464     
   100000000 src-targ pairs in   1.27976596     s:   1.56278574     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.93 s: 0.0519 Gpair/s
devec: 100000000 src-targ pairs in 0.264 s: 0.379 Gpair/s
devec par: 100000000 src-targ pairs in 0.0523 s: 1.91 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
numba:  100000000 src-targ pairs in 0.0512 s: 1.95 Gpair/s
pot err numba vs native:  5.12e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=-143.9898175208911       t=1.45 s        0.0688 Gpair/s
src-outer:      u(test)=-143.9898175208911       t=1.47 s        0.0679 Gpair/s
dense-matvec:   u(test)=-143.9898175208914       t=3.53 s        0.0283 Gpair/s



vscode@dc13b6d0654a:/workspaces/floatingspeed/lap3dkernel$ ./run_all_tests.sh 
C++SIMD:
5 repetitions each run...
N=10000, M=10000. manual VCL SIMD avx512, ans: 470638087.690212
                                time: 0.121 s           4.13 Gpair/sec
N=10000, M=10000. manual VCL SIMD avx2 (256), ans: 470638087.690212
                                time: 0.115 s           4.36 Gpair/sec
N=10000, M=10000. target outer loop, ans: 470638087.690212
                                time: 0.298 s           1.68 Gpair/sec
N=10000, M=10000. source outer loop, ans: 470638087.690212
                                time: 0.593 s           0.843 Gpair/sec
FORTRAN:
./run_all_tests.sh: line 10: gfortran: command not found
 ns=       10000    nt=       10000
 ntest =           20  ...
 tot=   7469534.6971662715     
   100000000 src-targ pairs in   1.24486721     s:   1.60659707     Gpair/s
JULIA:
targ-vec: 100000000 src-targ pairs in 1.96 s: 0.0511 Gpair/s
devec: 100000000 src-targ pairs in 0.259 s: 0.386 Gpair/s
devec par: 100000000 src-targ pairs in 0.0506 s: 1.98 Gpair/s
PYTHON:
native: 100000000 src-targ pairs in 1.98 s: 0.0504 Gpair/s
numba:  100000000 src-targ pairs in 0.0494 s: 2.02 Gpair/s
pot err numba vs native:  4.55e-13
MATLAB:
./run_all_tests.sh: line 22: matlab: command not found
OCTAVE:
octave: X11 DISPLAY environment variable not set
octave: disabling GUI features
test 1/r kernel in 3D. N=10000, M=10000...
targ-outer:     u(test)=120.4067525416816        t=1.43 s        0.0699 Gpair/s
src-outer:      u(test)=120.4067525416816        t=1.46 s        0.0687 Gpair/s
dense-matvec:   u(test)=120.4067525416806        t=3.55 s        0.0282 Gpair/s
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant