Preserve input types for various rules #89

ptiede · 2022-11-03T15:48:35Z

This pull request ensures that the input type is preserved for various rules.
Previously there were potentially a few places where 64 bit NaN's would always be produced
regardless of the input. To fix this I replaced any instance of :NaN with oftype($x, NaN).

Additionally, this pull-request fixes the issue with the ldexp rule from #88 which is similar in nature.

codecov-commenter · 2022-11-03T16:01:40Z

Codecov Report

Base: 97.31% // Head: 97.31% // No change to project coverage 👍

Coverage data is based on head (5767bca) compared to base (489e294).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@           Coverage Diff           @@
##           master      #89   +/-   ##
=======================================
  Coverage   97.31%   97.31%           
=======================================
  Files           3        3           
  Lines         186      186           
=======================================
  Hits          181      181           
  Misses          5        5

Impacted Files	Coverage Δ
src/rules.jl	`100.00% <100.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

devmotion · 2022-11-03T17:25:33Z

To fix this I replaced any instance of :NaN with oftype($x, NaN).

:NaN is the DiffRules way to say that some derivative does not exist or is not implemented. You never want to use these "derivatives" anyway, every time you do you are already screwed.

Generally, oftype($x, NaN) seems like the wrong approach as x might not even be a floating point number. Just using float($x) instead seems also wrong in the two-argument functions since it might cause undesired promotions if x is not a floating point number (imagine eg the other argument being of type Float32).

ptiede · 2022-11-03T18:23:11Z

Ok I reverted the :NaN for the two argument functions.

However for the ldexp I think the point remains. In that case you do need the oftype(float($x), exp2($y)) to prevent a spurious promotion for the derivative with respect to x.

devmotion · 2022-11-03T20:36:54Z

src/rules.jl

@@ -85,7 +85,7 @@ _abs_deriv(x) = signbit(x) ? -one(x) : one(x)
 @define_diffrule Base.atan(x, y)    = :( $y / ($x^2 + $y^2)                                 ), :( -$x / ($x^2 + $y^2)                                                     )
 @define_diffrule Base.hypot(x, y)  = :( $x / hypot($x, $y)                                      ), :(  $y / hypot($x, $y)                                                     )
 @define_diffrule Base.log(b, x)    = :( log($x) * inv(-log($b)^2 * $b)                          ), :( inv($x) / log($b)                                                       )
-@define_diffrule Base.ldexp(x, y)  = :( exp2($y)                                                ), :NaN
+@define_diffrule Base.ldexp(x, y)  = :( oftype(float($x), exp2($y))                                                ), :(oftype(float($x), NaN))


Could we even just use

Suggested change

@define_diffrule Base.ldexp(x, y) = :( oftype(float($x), exp2($y)) ), :(oftype(float($x), NaN))

@define_diffrule Base.ldexp(x, y) = :( oftype($x, exp2($y) ), :NaN

? At least it seems the definitions in Base already assume that x is a floating point number:

julia> methods(ldexp) # 7 methods for generic function "ldexp": [1] ldexp(x::Float16, q::Integer) in Base.Math at math.jl:826 [2] ldexp(x::T, e::Integer) where T<:Union{Float16, Float32, Float64} in Base.Math at math.jl:783 [3] ldexp(x::BigFloat, n::Int64) in Base.MPFR at mpfr.jl:648 [4] ldexp(x::BigFloat, n::Union{Int16, Int32, Int64, Int8}) in Base.MPFR at mpfr.jl:658 [5] ldexp(x::BigFloat, n::UInt64) in Base.MPFR at mpfr.jl:653 [6] ldexp(x::BigFloat, n::Union{UInt16, UInt32, UInt64, UInt8}) in Base.MPFR at mpfr.jl:659 [7] ldexp(x::BigFloat, n::Integer) in Base.MPFR at mpfr.jl:660

But maybe

Suggested change

@define_diffrule Base.ldexp(x, y) = :( oftype(float($x), exp2($y)) ), :(oftype(float($x), NaN))

@define_diffrule Base.ldexp(x, y) = :( oftype(float($x), exp2($y) ), :NaN

is safer - although maybe even safer would be something like oftype(ldexp($x, $y), ... (or something similar without evaluating the primal) in case the arguments are promoted in some way in some other, non-Base definitions.

I tend to agree for the first option. I would be worried about including the primal in the calculation since that is a much more expensive operation than exp2(y) which is just some bit shuffling since y is a integer.

src/rules.jl

devmotion

Looks good to me.

Make Float32 stable for both arguments

c810f8b

revert :NaN change

535597e

devmotion reviewed Nov 3, 2022

View reviewed changes

remove float guard

6828599

devmotion reviewed Nov 4, 2022

View reviewed changes

src/rules.jl Outdated Show resolved Hide resolved

Removed spurious NaN

5767bca

devmotion approved these changes Nov 8, 2022

View reviewed changes

devmotion merged commit 815d3d8 into JuliaDiff:master Nov 8, 2022

This was referenced Nov 10, 2022

Type instability in ldexp with Float32 arguments #88

Closed

ldexp does not maintain type of Float32 arguments JuliaDiff/ForwardDiff.jl#604

Closed

Adding complex broadcasting for gradients on the GPU FluxML/Zygote.jl#1324

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve input types for various rules #89

Preserve input types for various rules #89

ptiede commented Nov 3, 2022

codecov-commenter commented Nov 3, 2022 •

edited

Loading

devmotion commented Nov 3, 2022

ptiede commented Nov 3, 2022

devmotion Nov 3, 2022

ptiede Nov 4, 2022

devmotion left a comment

	@define_diffrule Base.ldexp(x, y) = :( oftype(float($x), exp2($y)) ), :(oftype(float($x), NaN))
	@define_diffrule Base.ldexp(x, y) = :( oftype($x, exp2($y) ), :NaN

Preserve input types for various rules #89

Preserve input types for various rules #89

Conversation

ptiede commented Nov 3, 2022

codecov-commenter commented Nov 3, 2022 • edited Loading

Codecov Report

devmotion commented Nov 3, 2022

ptiede commented Nov 3, 2022

devmotion Nov 3, 2022

Choose a reason for hiding this comment

ptiede Nov 4, 2022

Choose a reason for hiding this comment

devmotion left a comment

Choose a reason for hiding this comment

codecov-commenter commented Nov 3, 2022 •

edited

Loading