replace checked_fptosi intrinsics with Julia implementation #14763

simonbyrne · 2016-01-22T16:04:26Z

Fixes #14549.

As @yuyichao points out, this will effect inlining rules.

yuyichao · 2016-01-22T16:29:02Z

Let's check =) runbenchmarks(ALL, vs = "JuliaLang/julia:master")

simonbyrne · 2016-06-28T22:17:40Z

@nanosoldier runbenchmarks(ALL, vs=":master")

simonbyrne · 2016-06-28T22:19:28Z

This has been updated to do an explicit range check, so should no longer utilise any undefined behaviour

nanosoldier · 2016-06-29T00:50:10Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

simonbyrne · 2016-06-29T02:42:24Z

Hmm, the slowdown seems to be due to the fact that the length of FloatRange and LinSpace is a floating point number, which needs to be converted a lot.

tkelman · 2016-09-23T10:24:00Z

base/float.jl

+                    throw(InexactError())
+                end
+            end
+        else #


was there going to be a comment here then changed your mind?

I can't remember, this was quite a long time ago (if only I had left a comment on what that comment was going to be...)

simonbyrne · 2016-09-23T10:52:11Z

@nanosoldier runbenchmarks(ALL, vs=":master")

jrevels · 2016-09-23T15:53:33Z

@simonbyrne looks like you may have submitted the job with the wrong syntax, and then edited to the correct syntax? Nanosoldier ignores comment edits to prevent accidental job resubmission. Triggering it with a new comment should work:

@nanosoldier runbenchmarks(ALL, vs=":master")

simonbyrne · 2016-09-23T16:31:29Z

thanks!

vchuravy · 2016-09-23T17:58:27Z

Would this also work for Float16?

simonbyrne · 2016-09-23T18:32:56Z

It should do, the logic is pretty much the same.

nanosoldier · 2016-09-23T18:36:29Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

simonbyrne · 2016-09-24T18:26:13Z

Actually, it won't work for Float16, but I've added a note as to why. I didn't include the Float16 logic here, as the current method (converting to Float32 first) is probably more efficient.

simonbyrne · 2016-09-24T19:22:14Z

Updated. It seems like the change in performance is a bit of a wash. We may want to look at speeding up FloatRange and LinSpace either by inlining or perhaps even redesigning so the length is stored as an integer, but this is probably better in a different PR.

Any objections to merging? cc @ViralBShah @vchuravy

vchuravy · 2016-09-24T20:01:20Z

No objections to merging.

I would still be interested in a implementation for Float16. Mostly because I am looking at improving our support for Float16 on platforms that support them natively.

simonbyrne · 2016-09-24T20:14:43Z

It should be possible via an extra branch: just check if Tf(typemin(Ti)) == Inf, in which case use an inequality rather than equality for the lower bound.

tkelman · 2016-09-24T20:53:37Z

that's a pretty bad slowdown in sparse indexing, worth looking into and profiling the difference

simonbyrne · 2016-09-24T21:36:18Z

I couldn't figure it out: there don't appear to be any conversion calls, and I can't recreate it locally.

One more time:

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2016-09-25T00:14:09Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

simonbyrne · 2016-09-25T06:38:13Z

The sparse issue seems to have mostly gone.

simonbyrne · 2016-09-25T13:22:09Z

Given the inconsistency, it seems that the regressions are mostly noise. We do get a nice speed boost on some though. Shall we merge this?

musm · 2016-09-25T17:26:37Z

Out of curiosity, how reliable is nanosoldier if +-15% can be mainly attributed to noise?

…tations. Fixes #14549.

simonbyrne · 2016-09-30T08:55:25Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2016-09-30T11:32:54Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

simonbyrne · 2016-09-30T12:09:34Z

Okay, we'll have to figure out how to improve FloatRange and LinSpace indexing, but I think this is good to go.

simonbyrne · 2016-10-07T09:50:23Z

Will probably want to backport this to get tests to pass on ARM/Power in next release.

tkelman · 2016-10-07T10:26:38Z

jl_checked_fptoui and jl_checked_fptosi were exported from libjulia, but doesn't look like any packages call into them

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implementations. Fixes #14549. Explain logic behind float->integer conversion checking (cherry picked from commit f935a50)

This reverts commit ea37c3c. Revert "replace checked_fptosi intrinsics with Julia implementation (#14763)" This reverts commit 5512553. caused significant slowdown in conversion

simonbyrne changed the title ~~markdown: ensure line-break after displayed math~~ replace checked_fptosi intrinsics Jan 22, 2016

simonbyrne changed the title ~~replace checked_fptosi intrinsics~~ replace checked_fptosi intrinsics with Julia implementation Jan 22, 2016

simonbyrne force-pushed the sb/fptosi branch from 9734e5d to bf773a0 Compare June 28, 2016 22:09

simonbyrne force-pushed the sb/fptosi branch 2 times, most recently from c92f76e to fea8f26 Compare September 21, 2016 11:24

tkelman reviewed Sep 23, 2016

View reviewed changes

simonbyrne force-pushed the sb/fptosi branch from 8aa8741 to c9b115b Compare September 24, 2016 18:24

simonbyrne force-pushed the sb/fptosi branch from c9b115b to 88c66b4 Compare September 24, 2016 19:14

simonbyrne force-pushed the sb/fptosi branch from 88c66b4 to bc7a951 Compare September 24, 2016 20:17

simonbyrne added 2 commits September 29, 2016 21:45

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

ad872cb

…tations. Fixes #14549.

explain logic behind float->integer conversion checking

f0355cb

simonbyrne force-pushed the sb/fptosi branch from bc7a951 to f0355cb Compare September 29, 2016 20:45

simonbyrne merged commit f935a50 into master Sep 30, 2016

simonbyrne deleted the sb/fptosi branch September 30, 2016 12:13

simonbyrne added the backport pending 0.5 label Oct 7, 2016

simonbyrne mentioned this pull request Oct 15, 2016

regression in float to int performance on master #18954

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace checked_fptosi intrinsics with Julia implementation #14763

replace checked_fptosi intrinsics with Julia implementation #14763

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jun 28, 2016

simonbyrne commented Jun 28, 2016

nanosoldier commented Jun 29, 2016

simonbyrne commented Jun 29, 2016

tkelman Sep 23, 2016

simonbyrne Sep 23, 2016

simonbyrne commented Sep 23, 2016 •

edited

Loading

jrevels commented Sep 23, 2016

simonbyrne commented Sep 23, 2016

vchuravy commented Sep 23, 2016

simonbyrne commented Sep 23, 2016

nanosoldier commented Sep 23, 2016

simonbyrne commented Sep 24, 2016

simonbyrne commented Sep 24, 2016 •

edited

Loading

vchuravy commented Sep 24, 2016

simonbyrne commented Sep 24, 2016

tkelman commented Sep 24, 2016

simonbyrne commented Sep 24, 2016

nanosoldier commented Sep 25, 2016

simonbyrne commented Sep 25, 2016

simonbyrne commented Sep 25, 2016

musm commented Sep 25, 2016

simonbyrne commented Sep 30, 2016

nanosoldier commented Sep 30, 2016

simonbyrne commented Sep 30, 2016

simonbyrne commented Oct 7, 2016

tkelman commented Oct 7, 2016

replace checked_fptosi intrinsics with Julia implementation #14763

replace checked_fptosi intrinsics with Julia implementation #14763

Conversation

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jun 28, 2016

simonbyrne commented Jun 28, 2016

nanosoldier commented Jun 29, 2016

simonbyrne commented Jun 29, 2016

tkelman Sep 23, 2016

Choose a reason for hiding this comment

simonbyrne Sep 23, 2016

Choose a reason for hiding this comment

simonbyrne commented Sep 23, 2016 • edited Loading

jrevels commented Sep 23, 2016

simonbyrne commented Sep 23, 2016

vchuravy commented Sep 23, 2016

simonbyrne commented Sep 23, 2016

nanosoldier commented Sep 23, 2016

simonbyrne commented Sep 24, 2016

simonbyrne commented Sep 24, 2016 • edited Loading

vchuravy commented Sep 24, 2016

simonbyrne commented Sep 24, 2016

tkelman commented Sep 24, 2016

simonbyrne commented Sep 24, 2016

nanosoldier commented Sep 25, 2016

simonbyrne commented Sep 25, 2016

simonbyrne commented Sep 25, 2016

musm commented Sep 25, 2016

simonbyrne commented Sep 30, 2016

nanosoldier commented Sep 30, 2016

simonbyrne commented Sep 30, 2016

simonbyrne commented Oct 7, 2016

tkelman commented Oct 7, 2016

simonbyrne commented Sep 23, 2016 •

edited

Loading

simonbyrne commented Sep 24, 2016 •

edited

Loading