Implement `rem(float, float, RoundNearest)` in Julia #42380

ararslan · 2021-09-25T19:50:25Z

One of the few remaining uses of openlibm is implementing rem with RoundNearest for Float32 and Float64. This commit translates the msun libm implementations of remainder and remainderf to Julia to avoid relying on openlibm or the local system libm.

The implementations of the Float32 and Float64 methods are quite similar and could be combined with a bit more work. I'd be happy to do that but would need some guidance for some of the less obvious magic numbers.

Checks off the rem box in #26434.

KristofferC · 2021-09-25T19:57:43Z

Out if curiosity, any performance numbers to share?

ararslan · 2021-09-25T23:54:46Z

any performance numbers to share?

Good idea! Seems to do pretty well (first here is this implementation):

julia> @benchmark rem(x, p, RoundNearest) setup=((x, p) = rand(Float64, 2) .* rand(Bool, 2))
BenchmarkTools.Trial: 10000 samples with 1000 evaluations.
 Range (min … max):  3.696 ns … 84.758 ns  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     5.093 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   8.287 ns ±  4.779 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▇█▄▄▅                 ▂▂▅▇▆▅▃▂▂▂▁▂▂▃▃  ▁▁▂▁▁   ▁▁      ▁   ▂
  █████▄▁▁▄▅▁▁▁▁▁▁▁▁▁▁▁▇███████████████████████▇▇███▆▇▇█▇██▆ █
  3.7 ns       Histogram: log(frequency) by time     20.5 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> @benchmark Base.rem(x, p, RoundNearest) setup=((x, p) = rand(Float64, 2) .* rand(Bool, 2))
BenchmarkTools.Trial: 10000 samples with 999 evaluations.
 Range (min … max):   9.454 ns … 108.064 ns  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     35.748 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   31.370 ns ±   9.344 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▇▃                               ▇▃▆▃▁▂   █▇▅▁▄▃ ▁▁       ▃  ▂
  ███▇█▇▄▄▄▁▁▁▁▁▃▁▃▁▁▁▁▁▁▁▄▃▁▃▄▁▁▄▃██████▇█▅█████████▇▆█▆██▅█▇ █
  9.45 ns       Histogram: log(frequency) by time      46.5 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

oscardssmith · 2021-09-26T00:12:26Z

Any idea why this is faster? Does a ccall have noticable overhead?

ararslan · 2021-09-26T00:15:47Z

Any idea why this is faster?

None whatsoever

simonbyrne · 2021-09-26T04:40:13Z

This is perhaps one case where it would be fine to use the llvm intrinsic (since all implementations should return the same value), but having our own probably makes sense. Thanks @ararslan!

ararslan · 2021-09-26T04:52:17Z

I was looking into doing this directly with LLVM initially but AFAICT the available LLVM intrinsics don't have the RoundNearest behavior.

ViralBShah · 2021-09-26T17:09:41Z

Only modf to go after this!

ararslan · 2021-09-26T17:13:19Z

Only modf to go after this!

I have a local implementation of that one too, will be doing a bit more testing then making a PR. 🙂

oscardssmith · 2021-09-26T17:17:52Z

Don't we still have fma to go?

simonbyrne · 2021-09-26T19:29:26Z

I was looking into doing this directly with LLVM initially but AFAICT the available LLVM intrinsics don't have the RoundNearest behavior.

They don't? What do they use.

ararslan · 2021-09-26T20:01:38Z

It's not specified in the LLVM documentation but based on experimentation LLVM's frem seems to give the same result as our rem with RoundToZero. EDIT: They say it's the same as fmod, which explains that.

simonbyrne · 2021-09-26T21:00:29Z

Oh right, LLVM doesn't offer a remainder intrinsic 😞

simonbyrne · 2021-09-26T21:01:23Z

you may need some sort of copyright acknowledgement?

stevengj · 2021-09-27T18:45:16Z

Needs tests?

ararslan · 2021-09-27T18:46:47Z

There are existing tests for these methods which I assumed were sufficient but I can always add more.

ararslan · 2021-09-27T22:30:05Z

Test failures:

Linux x64: segfault in linear algebra tridiag tests, existing issue
FreeBSD x64: unwinding-related segfault, existing issue
Windows x86: timeout

All of these platforms successfully ran the numbers tests, which is where rem tests happen.

simonbyrne · 2021-09-27T22:40:14Z

It would be nice if we could get rid of some of the bit twiddling and use plain Julia functions (isnan, isfinite, etc), and combine them into one function

simonbyrne · 2021-09-28T04:15:02Z

I don't quite understand why it does rem(x, 2p) instead of rem(x, p). Any idea?

simonbyrne · 2021-09-28T04:19:56Z

Ah, because of the ties-to-even.

One of the few remaining uses of openlibm is implementing `rem` with `RoundNearest` for `Float32` and `Float64`. This commit translates the msun libm implementations of `__ieee754_remainder` and `___ieee754_remainderf` to Julia to avoid relying on openlibm.

ararslan · 2021-09-28T17:14:02Z

Made some changes, current state:

No more bit twiddling or reinterpreting, it's plain functions and direct comparisons of floats all the way down
Just one method definition, no longer separated by Float32 vs. Float64
Float16 now goes through the same code path as for Float32 and Float64 without needing to first promote to Float32
Added some extra tests for the edge cases
Performance still seems to be slightly better than current master

Summary of test failures:

Linux: Segfault from OpenBLAS
macOS: Segfault from OpenBLAS
FreeBSD: Segfault from libunwind

base/math.jl

Also make sure that signed zero is checked in the test

ararslan · 2021-09-29T21:40:11Z

Simon's feedback was incorporated and the only CI failure is FreeBSD, which is a known issue, so I'll go ahead and merge. Thanks!

KristofferC · 2021-09-30T07:49:29Z

Ended up very clean and nice!

One of the few remaining uses of openlibm is implementing `rem` with `RoundNearest` for `Float32` and `Float64`. This commit translates the msun libm implementations of `__ieee754_remainder` and `___ieee754_remainderf` to Julia to avoid relying on openlibm.

ararslan added the maths Mathematical functions label Sep 25, 2021

ararslan requested review from simonbyrne and stevengj September 25, 2021 19:50

oscardssmith self-requested a review September 25, 2021 20:47

ararslan force-pushed the aa/remainder branch from 9d4df8f to 88a962b Compare September 27, 2021 18:40

ararslan force-pushed the aa/remainder branch from 88a962b to bc13ba2 Compare September 28, 2021 05:56

simonbyrne approved these changes Sep 29, 2021

View reviewed changes

base/math.jl Outdated Show resolved Hide resolved

base/math.jl Outdated Show resolved Hide resolved

base/math.jl Outdated Show resolved Hide resolved

base/math.jl Outdated Show resolved Hide resolved

base/math.jl Outdated Show resolved Hide resolved

Algorithm and test simplifications

ec5dc21

Also make sure that signed zero is checked in the test

ararslan merged commit c8b5904 into master Sep 29, 2021

ararslan deleted the aa/remainder branch September 29, 2021 21:40

ViralBShah mentioned this pull request Sep 29, 2021

Remove openlibm #26434

Open

17 tasks

barucden mentioned this pull request Oct 6, 2021

Faster Float64^Float64 #42271

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `rem(float, float, RoundNearest)` in Julia #42380

Implement `rem(float, float, RoundNearest)` in Julia #42380

ararslan commented Sep 25, 2021

KristofferC commented Sep 25, 2021

ararslan commented Sep 25, 2021

oscardssmith commented Sep 26, 2021

ararslan commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

ararslan commented Sep 26, 2021

ViralBShah commented Sep 26, 2021

ararslan commented Sep 26, 2021

oscardssmith commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

ararslan commented Sep 26, 2021 •

edited

Loading

simonbyrne commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

stevengj commented Sep 27, 2021

ararslan commented Sep 27, 2021

ararslan commented Sep 27, 2021

simonbyrne commented Sep 27, 2021

simonbyrne commented Sep 28, 2021

simonbyrne commented Sep 28, 2021

ararslan commented Sep 28, 2021 •

edited

Loading

ararslan commented Sep 29, 2021

KristofferC commented Sep 30, 2021

Implement rem(float, float, RoundNearest) in Julia #42380

Implement rem(float, float, RoundNearest) in Julia #42380

Conversation

ararslan commented Sep 25, 2021

KristofferC commented Sep 25, 2021

ararslan commented Sep 25, 2021

oscardssmith commented Sep 26, 2021

ararslan commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

ararslan commented Sep 26, 2021

ViralBShah commented Sep 26, 2021

ararslan commented Sep 26, 2021

oscardssmith commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

ararslan commented Sep 26, 2021 • edited Loading

simonbyrne commented Sep 26, 2021

simonbyrne commented Sep 26, 2021

stevengj commented Sep 27, 2021

ararslan commented Sep 27, 2021

ararslan commented Sep 27, 2021

simonbyrne commented Sep 27, 2021

simonbyrne commented Sep 28, 2021

simonbyrne commented Sep 28, 2021

ararslan commented Sep 28, 2021 • edited Loading

ararslan commented Sep 29, 2021

KristofferC commented Sep 30, 2021

Implement `rem(float, float, RoundNearest)` in Julia #42380

Implement `rem(float, float, RoundNearest)` in Julia #42380

ararslan commented Sep 26, 2021 •

edited

Loading

ararslan commented Sep 28, 2021 •

edited

Loading