Newton-Raphson root find for Helmholtz EOS #304

AlexHls · 2023-09-12T12:51:02Z

PR Summary

Adds a Newton-Raphson root find, specifically for use with the Helmholtz EOS. Although the current regula falsi produces on average more precise results, it is significantly more expensive. Moreover it is the method used in the originally published paper by Timmes.

Plot comparing both root finds

This benchmark was generated evaluating the EOS at random input values. It is probably not super scientific/ accurate, but should suffice the advantage of the Newton-Raphson root find.

A few points may (or may not) need to be addressed still:

The current implementation expects a different function to be passed compared to the other root finds, i.e. it expects a tuple containing the derivative. This works straightforwards with the Helmholtz EOS, but might not work in general. However since this is only an internal function call this should be fine.
I choose to make the root find method a runtime option, but I'm not sure if this is the way to go. So far I think it is better to have this as a runtime option so one can easily switch between the two methods, but I might be mistaken here.
The default is now the Newton-Raphson root find, purely based on the better performance.

PR Checklist

Adds a test for any bugs fixed. Adds tests for new features.
Format your changes by using the make format command after configuring with cmake.
Document any new features, update documentation for changes made.
Make sure the copyright notice on any files you modified is up to date.
After creating a pull request, note it in the CHANGELOG.md file
If preparing for a new release, update the version in cmake.

jhp-lanl · 2023-09-12T19:02:04Z

Is there a possibility that the Newton method won't converge? If so, it might make sense to structure the root find option as use RF is NR fails or if the user specifically requests not to use NR.

AlexHls · 2023-09-13T08:36:39Z

Is there a possibility that the Newton method won't converge? If so, it might make sense to structure the root find option as use RF is NR fails or if the user specifically requests not to use NR.

Thanks for the suggestion! I've added RF as fallback to the NR in cases the NR root find fails.

As far as convergence is concerned, this a bit more difficult to answer. In my naive test program, RF was actually more likely to not converge, but I know that in astrophysical simulations the NR also has trouble converging in certain parameter spaces. The question is if the RF would actually converge better in those cases (e.g. there is nothing that can be done once the root find runs into the edges of the underlying table) and, more importantly, if the non-convergence actually matters in these cases (vs the increased computational cost). To my knowledge, most astrophysical simulations have 'ignored' this and just bound the result of the root find back to the tabulated values. I might be mistaken though, but I guess this is something that one would first need to investigate extensively.

For now I have implemented RF as a fallback if NR does not converge, in my opinion this is a sensible default.

jhp-lanl · 2023-09-13T15:45:14Z

As far as convergence is concerned, this a bit more difficult to answer. In my naive test program, RF was actually more likely to not converge, but I know that in astrophysical simulations the NR also has trouble converging in certain parameter spaces. The question is if the RF would actually converge better in those cases (e.g. there is nothing that can be done once the root find runs into the edges of the underlying table) and, more importantly, if the non-convergence actually matters in these cases (vs the increased computational cost). To my knowledge, most astrophysical simulations have 'ignored' this and just bound the result of the root find back to the tabulated values. I might be mistaken though, but I guess this is something that one would first need to investigate extensively.

Interesting! I suppose I'm used to using the RF method on problems that have an easy way to bracket the solution a priori via some known bounds. In those cases, the RF method is guaranteed to converge albeit slowly sometimes. I assume that when you say that RF didn't converge, it means that you were unable to bracket the solution? Or was it simply converging so slowly that it ran out of iterations?

Another thing we could consider in the future is to improve the RF method to use either the Pegasus or Illinois algorithms (http://paulklein.se/newsite/teaching/rootfinding.pdf) to get superlinear convergence.

AlexHls · 2023-09-14T09:24:38Z

As far as convergence is concerned, this a bit more difficult to answer. In my naive test program, RF was actually more likely to not converge, but I know that in astrophysical simulations the NR also has trouble converging in certain parameter spaces. The question is if the RF would actually converge better in those cases (e.g. there is nothing that can be done once the root find runs into the edges of the underlying table) and, more importantly, if the non-convergence actually matters in these cases (vs the increased computational cost). To my knowledge, most astrophysical simulations have 'ignored' this and just bound the result of the root find back to the tabulated values. I might be mistaken though, but I guess this is something that one would first need to investigate extensively.

Interesting! I suppose I'm used to using the RF method on problems that have an easy way to bracket the solution a priori via some known bounds. In those cases, the RF method is guaranteed to converge albeit slowly sometimes. I assume that when you say that RF didn't converge, it means that you were unable to bracket the solution? Or was it simply converging so slowly that it ran out of iterations?

Another thing we could consider in the future is to improve the RF method to use either the Pegasus or Illinois algorithms (http://paulklein.se/newsite/teaching/rootfinding.pdf) to get superlinear convergence.

Thanks for the interesting reference! I definitely want to take a deeper dive into this at some point.

In case of the RF failures, as far as I can tell the root find reaches the maximum number of iterations. (Note MAX_ITER_RF = 1000 vs MAX_ITER_NR = 100.

Ultimately I think that this needs a thorough investigation to make an informed decision / analysis on the advantages of different root finding algorithms. The point of this MR is mainly to make the NR root find available for the Helmholtz EOS, as it is the tried and testend method for this EOS (not to speak of the significant speedup, which is important for the applications I'm working with) that so far has not been questioned. In the end it is probably an extremely efficient algorithm for this use case since F. Timmes fine tuned the underlying table to work with NR (i.e. taking care that the derivatives are accurate).

jhp-lanl · 2023-09-14T14:37:56Z

The point of this MR is mainly to make the NR root find available for the Helmholtz EOS, as it is the tried and testend method for this EOS (not to speak of the significant speedup, which is important for the applications I'm working with) that so far has not been questioned. In the end it is probably an extremely efficient algorithm for this use case since F. Timmes fine tuned the underlying table to work with NR (i.e. taking care that the derivatives are accurate).

Seems fair! I'll take a closer look when I get a chance but @Yurlungur should also review

jhp-lanl

Minor changes/questions

jhp-lanl · 2023-09-14T15:06:45Z