nqt o2 #433

Yurlungur · 2024-11-17T23:02:20Z

PR Summary

This threads in the second order NQT method developed by Peter Hammond on the AthenaK team. More details soon as I develop them. Also there will be an archive note eventually.

PR Checklist

Adds a test for any bugs fixed. Adds tests for new features.
Format your changes by using the make format command after configuring with cmake.
Document any new features, update documentation for changes made.
Make sure the copyright notice on any files you modified is up to date.
After creating a pull request, note it in the CHANGELOG.md file.
LANL employees: make sure tests pass both on the github CI and on the Darwin CI

If preparing for a new release, in addition please check the following:

Update the version in cmake.
Move the changes in the CHANGELOG.md file under a new header for the new release, and reset the categories.
Ensure that any when='@main' dependencies are updated to the release version in the package.py

mauneyc-LANL · 2024-11-18T17:32:44Z

singularity-eos/base/fast-math/logs.hpp

Apologies if there's code I'm not seeing that addresses this, but do we check if the precision of the typedef Real is double before including this? Is there equivalent 'float' procedures?

There are not equivalent float procedures at this time. Nor are there little endians. However all functions in this code are explicitly doubles.

You're worried about building for single-precision I assume. Yes, we should be worried about that. Let's extend that in a later PR, though. Getting this working was enough of a lift.

mauneyc-LANL

Comments for discussion rather than blocking requests.

singularity-eos/base/fast-math/logs.hpp

Yurlungur · 2024-11-19T05:02:26Z

Timing results

x86-volta

NQT o1

                ...L2 difference = 0.0254583
                ...EOSPAC time/point (microseconds) = 0.493015
                ...spiner host time/point (microseconds) = 0.0335447
                ...spiner device time/point (microseconds) = 0.00160943

NQT o2

                ...L2 difference = 0.00807353
                ...EOSPAC time/point (microseconds) = 0.491144
                ...spiner host time/point (microseconds) = 0.033117
                ...spiner device time/point (microseconds) = 0.00142899

NQT true

                ...L2 difference = 0.0171703
                ...EOSPAC time/point (microseconds) = 0.491845
                ...spiner host time/point (microseconds) = 0.0619208
                ...spiner device time/point (microseconds) = 0.00149041

grace-hopper

NQT o1

                ...L2 difference = 0.0254583
                ...EOSPAC time/point (microseconds) = 0.327759
                ...spiner host time/point (microseconds) = 0.0110745
                ...spiner device time/point (microseconds) = 0.000517656

NQT o2

                ...L2 difference = 0.00807353
                ...EOSPAC time/point (microseconds) = 0.302807
                ...spiner host time/point (microseconds) = 0.0137009
                ...spiner device time/point (microseconds) = 0.000571444

NQT true

                ...L2 difference = 0.0171703
                ...EOSPAC time/point (microseconds) = 0.326109
                ...spiner host time/point (microseconds) = 0.0203759
                ...spiner device time/point (microseconds) = 0.000564959

Yurlungur · 2024-11-19T12:52:03Z

Pointwise error

Yurlungur · 2024-11-20T04:23:49Z

Timings for the stellar collapse reader in nanoseconds/point

jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_o1_x86_volta.txt 
host = 22.3432
device = 0.641151
jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_o2_x86_volta.txt 
host = 31.665
device = 0.612225
jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_true_x86_volta.txt 
host = 77.0338
device = 0.665203
jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_o1_grace_hopper.txt 
host = 9.79919
device = 0.443848
jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_o2_grace_hopper.txt 
host = 15.484
device = 0.453125
jonahm@sgrb:~/simulations/spiner$ tail -n 2 timings_SFHo_true_grace_hopper.txt 
host = 22.6683
device = 0.443237

so on GPUs, no difference. On grace, o2/true speedup is ~25% and on x86, its ~2x.

Yurlungur · 2024-11-20T04:24:01Z

Error plots

Yurlungur · 2024-11-20T04:24:31Z

@jhp-lanl @dholladay00 @pdmullen @jdolence this is ready for review

Yurlungur · 2024-11-20T04:25:05Z

example/eos_grid.cpp

New example showing how to profile. Can be set to profile any EOS, even analytic.

CMakeLists.txt

test/test_bounds.cpp

test/test_math_utils.cpp

singularity-eos/base/fast-math/logs.hpp

doc/sphinx/src/contributing.rst

Yurlungur · 2024-11-20T04:57:09Z

doc/sphinx/src/contributing.rst

The change to this documentation is where the core of the method is. Essentially the quadratic interpolation of $\lg(mantissa)$ on the interval $[0.5, 1)$ has the advantage that the NQT function is C1 everywhere, which improves convergence of linear interpolation. The function itself is still a 100% accurate version of itself, but improving the continuity class of the method improves convergence of stencil ops.

CMakeLists.txt

singularity-eos/base/fast-math/logs.hpp

test/test_bounds.cpp

test/test_math_utils.cpp

doc/sphinx/src/contributing.rst

singularity-eos/eos/eos_spiner.hpp

Yurlungur · 2024-11-21T02:19:35Z

Tests passing on re-git. Any objections to merging this? @AstroBarker lets discuss phoebus when you get the chance.

jonahm-LANL added 2 commits November 17, 2024 12:40

Thread through quadratic logs. Thanks Peter and Jacob.

9076611

bounds typo

820b9dc

Yurlungur added bug Something isn't working enhancement New feature or request labels Nov 17, 2024

Yurlungur requested review from dholladay00, jdolence, pdmullen, mauneyc-LANL and jhp-lanl November 17, 2024 23:02

Yurlungur self-assigned this Nov 17, 2024

jonahm-LANL added 3 commits November 18, 2024 07:55

output on failure

ea15582

useless error messages... and it works on my machine...

b384d2c

formatting

82c5c24

mauneyc-LANL reviewed Nov 18, 2024

View reviewed changes

mauneyc-LANL approved these changes Nov 18, 2024

View reviewed changes

jonahm-LANL added 4 commits November 18, 2024 12:14

fix error bounds on CI machine

c17e01e

clean up shared constants

5de1fdb

move to constexpr if with settings namespace

91b104d

it passes on my machine???

5cdfc2d

jonahm-LANL added 2 commits November 18, 2024 22:33

the anchor appears shifted by 1 point, weirdly.

8889f77

or not supported...

75afc4d

Yurlungur changed the title ~~[WIP] nqt o2~~ nqt o2 Nov 19, 2024

jonahm-LANL added 2 commits November 19, 2024 20:02

add eos_grid example

5ee3465

remove eps which is unused

ee8e090

Yurlungur requested a review from AstroBarker November 20, 2024 03:10

Yurlungur commented Nov 20, 2024

View reviewed changes

Yurlungur requested a review from chadmeyer November 20, 2024 04:39

pdmullen approved these changes Nov 20, 2024

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

test/test_bounds.cpp Show resolved Hide resolved

test/test_math_utils.cpp Show resolved Hide resolved

singularity-eos/base/fast-math/logs.hpp Show resolved Hide resolved

AstroBarker reviewed Nov 20, 2024

View reviewed changes

doc/sphinx/src/contributing.rst Show resolved Hide resolved

Yurlungur commented Nov 20, 2024

View reviewed changes

jonahm-LANL added 4 commits November 20, 2024 09:06

move compute bulk modulus to BEFORE reinterpolating to fast logs

62188b2

formatting

a188f2a

mantiss -> mantissa

fcbef9d

add versioning to sp5 files

66f6fd7

pdmullen reviewed Nov 20, 2024

View reviewed changes

singularity-eos/eos/eos_spiner.hpp Show resolved Hide resolved

add ability to densify rho/T in fast log table for stellar collapse

d9f29c0

jonahm-LANL added 2 commits November 20, 2024 19:58

BMOD IS LOGGED

c8ea509

zero-initialize sieOffset

32abaad

Yurlungur merged commit d7af402 into main Nov 21, 2024
6 checks passed

Yurlungur deleted the jmm/nqt-o2 branch November 21, 2024 23:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nqt o2 #433

nqt o2 #433

Yurlungur commented Nov 17, 2024

mauneyc-LANL Nov 18, 2024

Yurlungur Nov 19, 2024

mauneyc-LANL left a comment

Yurlungur commented Nov 19, 2024 •

edited

Loading

Yurlungur commented Nov 19, 2024

Yurlungur commented Nov 20, 2024 •

edited

Loading

Yurlungur commented Nov 20, 2024

Yurlungur commented Nov 20, 2024

Yurlungur Nov 20, 2024

Yurlungur Nov 20, 2024

Yurlungur commented Nov 21, 2024

nqt o2 #433

nqt o2 #433

Conversation

Yurlungur commented Nov 17, 2024

PR Summary

PR Checklist

mauneyc-LANL Nov 18, 2024

Choose a reason for hiding this comment

Yurlungur Nov 19, 2024

Choose a reason for hiding this comment

mauneyc-LANL left a comment

Choose a reason for hiding this comment

Yurlungur commented Nov 19, 2024 • edited Loading

x86-volta

NQT o1

NQT o2

NQT true

grace-hopper

NQT o1

NQT o2

NQT true

Yurlungur commented Nov 19, 2024

Yurlungur commented Nov 20, 2024 • edited Loading

Yurlungur commented Nov 20, 2024

Yurlungur commented Nov 20, 2024

Yurlungur Nov 20, 2024

Choose a reason for hiding this comment

Yurlungur Nov 20, 2024

Choose a reason for hiding this comment

Yurlungur commented Nov 21, 2024

Yurlungur commented Nov 19, 2024 •

edited

Loading

Yurlungur commented Nov 20, 2024 •

edited

Loading