Fix perturbation confusion #247

simonbyrne · 2017-08-03T22:55:36Z

This is an initial stab at fixing perturbation confusion. It relies on a global iterator which is incremented for each (function, signature) pair: since higher-order derivatives require earlier derivatives to be defined first, these should appear on the "outside" of any nested Dual objects.

As I note in the comments, this could cause problems when using multiple processes, e.g.

ForwardDiff.gradient(x) do x
    @parallel for i = 1:n
        ...
        ForwardDiff.gradient(y) do y
            # something involving x
        end
    end
end

ChrisRackauckas · 2017-08-03T22:59:15Z

As I note in the comments, this could cause problems when using multiple processes, e.g.

👎 I think this may be too common of a use case. Is there an easy way to opt out for parallel usage?

simonbyrne · 2017-08-03T23:04:38Z

Well, it's not a common use case at the moment, because it won't work!

There are a couple of options:

Nothing, and just tell users not to do it.
We could stick processid in the tag info, and throw an error if Duals come from different processes
We could always fetch new tag ids from the master process (this is only done at compilation time, so shouldn't be too burdensome).

ChrisRackauckas · 2017-08-03T23:11:50Z

Well, it's not a common use case at the moment, because it won't work!

Oh wait, it only is when you parallel take the gradient inside of a function you're taking a gradient of? I read it as though all uses inside of @parallel would break. My bad.

simonbyrne · 2017-08-03T23:13:55Z

It occurs when Duals are generated by different processes. The example I gave was the most likely one I could think of

jrevels · 2017-08-07T17:59:22Z

It's pretty crazy/cool that this works. When I tried implementing the global counter approach back in the days of #83, I got hung up on the insane metaprogramming/compile time regressions induced by the static tag selection logic; I never thought to use ForwardDiff's existing conversion methods to handle this! Good idea.

We can merge this PR if you make the following tweaks:

Fix the parallel safety problem by adding the process ID to the tag. If two tags share the same count, but have different process IDs, then throw a TagMismatchError. Without this fix, this tagging system is strictly less safe than our current tagging system (since our current tagging system doesn't allow such calculations at all).
As this PR is now, the application of the promotion rule is only implicitly enforced by an assumption that it will be encountered during computation/construction. If that assumption is violated, a differential operator could erroneously extract partials that it doesn't own, causing silently incorrect answers. We should thus enforce this rule explicitly by using tag-checked partial extraction. You can do this by adding the following methods:

# You'll have to move the Tag definition into dual.jl and adjust 
# the TagMismatchError code to accommodate these methods.

@inline value(::Tag, x) = value(x)
@inline value(t::Tag, d::Dual) = throw(TagMismatchError(t, d))
@inline value(::T, d::Dual{T}) where {T<:Tag} = value(d)

@inline partials(::Tag, x, i...) = partials(x, i...)
@inline partials(t::Tag, d::Dual, i...) = throw(TagMismatchError(t, d))
@inline partials(::T, d::Dual{T}, i...) where {T<:Tag} = partials(d, i...)

...and then use those methods in the API implementation, instead of the unchecked versions.

simonbyrne · 2017-09-19T22:16:33Z

Sorry it's taken me a while to get around to this. Out of curiosity, why the for R in REAL_TYPES thing, rather than simply ::Real?

jrevels · 2017-09-20T13:24:21Z

Out of curiosity, why the for R in REAL_TYPES thing, rather than simply ::Real?

Because otherwise, we'd introduce a large number of method ambiguities. Traditionally, this multiple-dispatch-centric problem is solved by promoting to single dispatch, but this promotion is too costly for the general case (which is one of the motivations behind Cassette, where the problem is solved via contextual dispatch).

This is an initial stab at fixing perturbation confusion. It relies on a global iterator which is incremented for each (function, signature) pair: since higher-order derivatives require earlier derivatives to be defined first, these should appear on the "outside" of any nested `Dual` objects. As I note in the comments, this could cause problems when using multiple processes, e.g. ``` ForwardDiff.gradient(x) do x @parallel for i = 1:n ... ForwardDiff.gradient(y) do y # something involving x end end end ```

Tags are process-local (will throw an error if you attempt to mix tags from different processes). Also changed how tags are generated (function is no longer part of the signature).

simonbyrne · 2017-09-20T23:16:28Z

test/DualTest.jl

+    #     @test Dual{1}(FDNUM) / FDNUM2 === Dual{1}(FDNUM / FDNUM2)
+    #     @test FDNUM / Dual{1}(FDNUM2) === Dual{1}(FDNUM / FDNUM2)
+    #     @test Dual{1}(FDNUM / PRIMAL, FDNUM2 / PRIMAL) === Dual{1}(FDNUM, FDNUM2) / PRIMAL
+    # end


I had to disable these tests since I got rid of some of the binary methods (the dispatch is now handled by the promotion machinery).

Those tests are pretty important to keep working - they're one of the few places where we stress nested semantics directly. Assuming this PR is correct, it should be possible to modify these tests to pass without violating their original intent.

simonbyrne · 2017-09-20T23:17:44Z

Okay, hopefully this should now work. I removed the function type from the Tag, so had to modify some of the methods, but it does make the code a bit cleaner.

simonbyrne · 2017-09-21T01:59:17Z

Any suggestions about the appveyor failure?

jrevels · 2017-09-21T14:57:02Z

I removed the function type from the Tag, so had to modify some of the methods, but it does make the code a bit cleaner.

We still actually need the function type in the tag. This is to prevent people from accidentally reusing tagged <:AbstractConfig objects with the wrong functions. Following that, it seems like you have removed the explicit opt-out mechanism (where I was using Void as the tag before). After adding back in the function type, you'll need to add back in the explicit opt-out mechanism so that we can support "unsafe" <:AbstractConfig preallocation for reuse across multiple target functions.

simonbyrne · 2017-09-24T21:04:09Z

Okay, this has been a bit of a reworking. Now the tag is parametrised by (function, eltype), and a generated function gives each tag a unique sequence number which is used for comparison.

This has the side-benefit that it should now work across processes (since each process should safely generate its own sequence).

Thoughts?

tkoolen · 2017-09-25T16:14:23Z

We just ran into another perturbation confusion case (one part of JuliaRobotics/RigidBodyDynamics.jl#347) which is fixed by this PR (thanks!).

I submitted a PR against sb/confused that adds a test case distilled from the RigidBodyDynamics issue: simonbyrne#1.

jrevels · 2017-09-25T16:46:35Z

We just ran into another perturbation confusion case

I'm not sure what the bug actually is - I only skimmed the RigidBodyDynamics issue - but that test you filed doesn't involve any perturbation confusion AFAICT. The error message ForwardDiff is giving giving shows that it's an API problem; a config was "incorrectly" constructed/applied:

The provided configuration (of type [...omitted this for brevity...]) was constructed for a function other than the current target function. 
ForwardDiff cannot safely perform differentiation in this context; see the following issue for details: https://github.com/JuliaDiff/ForwardDiff.jl/issues/83. 
You can resolve this problem by constructing and using a configuration with the appropriate target function, e.g. `ForwardDiff.GradientConfig(#4, x)`

ForwardDiff should've worked on that case without this PR. In fact, it does work if you get rid of the let block, which makes me think it's an inference problem of some sort (or possibly that ForwardDiff is incorrectly relying on inference behavior in some way).

I'm going to file a separate issue for figuring out what that bug actually is. Thanks for the test!

Add test for JuliaDiff#247

jrevels · 2017-09-26T19:28:18Z

@simonbyrne Is this ready to merge? It LGTM - awesome work!

EDIT: Ah, I see, there's still the issue of how custom/opt-out tags promote...

simonbyrne · 2017-09-26T19:43:11Z

Yeah, basically if you use opt out tags you have to specify your own promotion. Not sure if there is much we can do about that.

jrevels · 2017-09-26T19:52:47Z

Fair enough; the easy way to see whether or not this breaks downstream code will be to merge it 😛

simonbyrne force-pushed the sb/confused branch from 2261d04 to 5249226 Compare September 20, 2017 22:57

simonbyrne added 2 commits September 20, 2017 16:06

Finalise perturbation confusion fix

8f76a69

Tags are process-local (will throw an error if you attempt to mix tags from different processes). Also changed how tags are generated (function is no longer part of the signature).

simonbyrne force-pushed the sb/confused branch from 5249226 to 8f76a69 Compare September 20, 2017 23:14

simonbyrne commented Sep 20, 2017

View reviewed changes

simonbyrne added 2 commits September 21, 2017 11:48

extract values and partials by tag

1e8f6a6

remove unnecessary exception

6d7e8e4

This was referenced Sep 22, 2017

Dual / Real gives incurs unnecessary error. #264

Closed

Perturbation confusion #238

Open

simonbyrne added 2 commits September 22, 2017 15:13

Reinstate tests, change one to approximate pending JuliaDiff#264

516b455

try this instead

9684bac

tkoolen mentioned this pull request Sep 25, 2017

JuMP autodiff of dynamics! not working in specific cases (upstream issue) JuliaRobotics/RigidBodyDynamics.jl#347

Closed

jrevels mentioned this pull request Sep 25, 2017

let scope causes nested differentiation error #267

Open

tkoolen and others added 3 commits September 25, 2017 13:38

Add another test for a confusion perturbation case.

de1ffd5

add tests for JuliaDiff#238

ee343a2

Merge pull request #1 from tkoolen/tk-add-confusion-test

b8d5861

Add test for JuliaDiff#247

jrevels mentioned this pull request Sep 26, 2017

remove DiffBase dependency and update ForwardDiff dependency JuliaDiff/ReverseDiff.jl#94

Merged

jrevels merged commit 292bfdc into JuliaDiff:master Sep 26, 2017

simonbyrne deleted the sb/confused branch September 26, 2017 19:56

ranocha mentioned this pull request Oct 8, 2017

Update for ForwardDiff v0.6.0 SciML/OrdinaryDiffEq.jl#200

Closed

mohamed82008 mentioned this pull request Mar 7, 2021

Remove pure function annotation #509

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix perturbation confusion #247

Fix perturbation confusion #247

simonbyrne commented Aug 3, 2017

ChrisRackauckas commented Aug 3, 2017

simonbyrne commented Aug 3, 2017

ChrisRackauckas commented Aug 3, 2017

simonbyrne commented Aug 3, 2017

jrevels commented Aug 7, 2017

simonbyrne commented Sep 19, 2017

jrevels commented Sep 20, 2017

simonbyrne Sep 20, 2017

jrevels Sep 21, 2017

simonbyrne commented Sep 20, 2017

simonbyrne commented Sep 21, 2017

jrevels commented Sep 21, 2017

simonbyrne commented Sep 24, 2017

tkoolen commented Sep 25, 2017

jrevels commented Sep 25, 2017

jrevels commented Sep 26, 2017 •

edited

Loading

simonbyrne commented Sep 26, 2017

jrevels commented Sep 26, 2017

Fix perturbation confusion #247

Fix perturbation confusion #247

Conversation

simonbyrne commented Aug 3, 2017

ChrisRackauckas commented Aug 3, 2017

simonbyrne commented Aug 3, 2017

ChrisRackauckas commented Aug 3, 2017

simonbyrne commented Aug 3, 2017

jrevels commented Aug 7, 2017

simonbyrne commented Sep 19, 2017

jrevels commented Sep 20, 2017

simonbyrne Sep 20, 2017

Choose a reason for hiding this comment

jrevels Sep 21, 2017

Choose a reason for hiding this comment

simonbyrne commented Sep 20, 2017

simonbyrne commented Sep 21, 2017

jrevels commented Sep 21, 2017

simonbyrne commented Sep 24, 2017

tkoolen commented Sep 25, 2017

jrevels commented Sep 25, 2017

jrevels commented Sep 26, 2017 • edited Loading

simonbyrne commented Sep 26, 2017

jrevels commented Sep 26, 2017

jrevels commented Sep 26, 2017 •

edited

Loading