feat: use DifferentiationInterface for sparse AD #468

avik-pal · 2024-10-01T21:13:36Z

The only part where we don't use DI is structured Jacobians. That still uses SparseDiffTools

TODO:

construct a case with structured jacobian that is small enough for testing. something block diagonal is the easiest to generate
merge feat: use DI for structured Jacobians #470
start removing sparsedifftools from the tutorials/docs
- docs
- tests
needs Coloring and decompression for structured matrices gdalle/SparseMatrixColorings.jl#132

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

gdalle

Congrats on the adaptation and thank you for trusting me with this!
I've never used LinearSolve before so take my comments with a grain of salt, but I find it needlessly complicated to mix the SparseDiffTools legacy syntax with the new sparse API of ADTypes. Do you think it is a problem?

docs/Project.toml

docs/src/basics/sparsity_detection.md

src/internal/jacobian.jl

avik-pal · 2024-10-01T23:06:50Z

but I find it needlessly complicated to mix the SparseDiffTools legacy syntax with the new sparse API of ADTypes.

This preserves support for structured matrices. Once the support for that lands in SparseMatrixColorings we can just delete that check and everything will just work.

avik-pal · 2024-10-01T23:13:15Z

To clarify the rationale behind marking AutoSparse as deprecated:

Previously we had sparse backends (simple wrapper over dense backend) and we had sparsity specification inside the problem (sparsity, jac_prototype, colorvec). But now the AutoSparse backend holds the sparsity detection + coloring information. So this creates chances for ambiguity, say user says sparsity = Symbolics..Detector() and AutoSparse(..., Tracer...Detector()) what should take precedence? We could choose to make the autodiff via the solver have higher precedence. But the other way is also perfectly valid. This mostly causes confusion without a clear benefit (at least to me)

Now what does the new-API look like? Users always provide a dense_ad to the solver. If the function has sparsity information in the form of a detector/colorvec/prototype (basically any hint that the jacobian is sparse) we construct a AutoSparse backend using that information. From DI's perspective it always sees the final autodiff that we constructed and not the one user passed in.

gdalle · 2024-10-02T05:33:07Z

Right, so essentially you reconstruct an AutoSparse from

the backend provided to the solver
the sparsity detector / coloring provided to the function

That seems reasonable for now, but it might lead to a breaking change when we swing back to full AutoSparse support?

gdalle · 2024-10-02T05:56:19Z

This preserves support for structured matrices. Once the support for that lands in SparseMatrixColorings we can just delete that check and everything will just work.

Just to clarify, with structured matrices, there are two things we can optimize:

the coloring, for which an optimal solution is often known
the decompression, because storage is e.g. bandwise instead of columnwise

At the moment, if I understand correctly:

the coloring is in ArrayInterface.jl, reused by SparseDiffTools.jl?
the decompression is in these extensions of FiniteDiff.jl, reused by SparseDiffTools.jl

I'll try to come up with a prototype putting everything in SparseMatrixColorings.jl

gdalle · 2024-10-02T12:14:26Z

On this branch you have optimized coloring and decompression for Diagonal, Bidiagonal, Tridiagonal and BandedMatrix: gdalle/SparseMatrixColorings.jl#132. Wanna try it out?

avik-pal · 2024-10-02T13:59:29Z

On this branch you have optimized coloring and decompression for Diagonal, Bidiagonal, Tridiagonal and BandedMatrix: gdalle/SparseMatrixColorings.jl#132. Wanna try it out?

I will stack a PR on top of this and try it out

avik-pal · 2024-10-02T14:06:53Z

That seems reasonable for now, but it might lead to a breaking change when we swing back to full AutoSparse support?

What do you mean by full AutoSparse support?

I am generally against having 2 APIs to accomplish the exact same thing unless each API provides additional disjoint benefits. If you are referring to the coloring_algorithm, we can always rename colorvec to coloring and construct AutoSparse based on the type of coloring.

@ChrisRackauckas what do you think about this new API proposal for v4?

gdalle · 2024-10-02T14:22:07Z

What do you mean by full AutoSparse support?

I meant that for users who pass an AutoSparse backend to NonLinearSolve, it might be surprising to see that the choices of coloring_algorithm / sparsity_detector inside the backend are discarded, in favor of the choices colorvec / sparsity given in the function itself?

If you want to deprecate this way of handling sparsity, I would almost prefer throwing an error if someone gives you an AutoSparse.

avik-pal · 2024-10-02T14:27:38Z

If you want to deprecate this way of handling sparsity, I would almost prefer throwing an error if someone gives you an AutoSparse.

Yes I agree. The function currently throws a depwarn, that will become an error in v4

gdalle · 2024-10-02T15:39:55Z

I added support for BlockBandedMatrix and BandedBlockBandedMatrix to the same PR. Now we have the exact same colorings as ArrayInterface, although decompression could use a little more work (but it's not terrible).

gdalle · 2024-10-02T16:20:09Z

What else would you need to drop SparseDiffTools completely? In terms of functionality it should be all there, performance may still lag a bit behind but honestly I'm not even sure.

avik-pal · 2024-10-02T16:21:07Z

see the other PR. tests pass locally for me, so we can go ahead and remove sparsedifftools. I will make some additional pushes to remove sparse diff tools from the tests and docs

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

avik-pal · 2024-10-02T16:57:16Z

Added a big table to clarify the selection mechanism https://github.com/SciML/NonlinearSolve.jl/blob/ap/sparse_di/docs/src/basics/sparsity_detection.md

oscardssmith · 2024-10-02T16:58:02Z

Is there anything we should be doing to pass down the sparsity info that we get down to LinearSolve? It seems like knowing that should be able to make the solve faster.

avik-pal · 2024-10-02T16:58:54Z

We are giving LinearSolve the exact matrix it needs to solve

avik-pal · 2024-10-02T16:59:33Z

We assume for SparseMatrixCSC jacobians the sparsity struture cannot arbitrarily change over iterations

gdalle · 2024-10-02T17:13:41Z

Yeah that goes without saying for me but if you prepare DI's Jacobian with a given sparsity pattern, you'll get incorrect outputs if the sparsity changes between points

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

gdalle · 2024-10-02T19:32:17Z

Fair warning: you're probably going to see an epic slowdown when computing sparse Jacobians with FiniteDiff. The reason is that, in DI, sparse Jacobians rely on pushforwards, and I don't have a native pushforward operator in FiniteDiff so I make do with a derivative closure.

oscardssmith · 2024-10-02T20:19:05Z

Can a native pushforward be added to FiniteDiff? it seems like it should be pretty straightforward...

gdalle · 2024-10-02T20:37:11Z

Knock yourselves out! I'll happily use it.

gdalle · 2024-10-02T21:21:33Z

In the meantime, maybe I'll try to use the sparse Jacobian inside FiniteDiff as a workaround

oscardssmith · 2024-10-03T01:40:52Z

JuliaDiff/FiniteDiff.jl#191

gdalle · 2024-10-03T11:17:23Z

@avik-pal with the very newest versions of DifferentiationInterface (0.6.4) and SparseMatrixColorings (0.4.4), your structured tests from #470 should also pass.
If you just want the thing to work but you're okay with slightly suboptimal colorings until gdalle/SparseMatrixColorings.jl#139 is merged, then you can remove the dependency on a specific branch and use the latest from the registry. I added tests for structured matrices in gdalle/SparseMatrixColorings.jl#137 and they already succeed with the current version of the package, without the optimized implementation of gdalle/SparseMatrixColorings.jl#139.

avik-pal added 3 commits October 1, 2024 15:27

refactor: reorder imports

5f76313

refactor: remove Symbolics

78e19a9

feat: use DI for sparse jacobians

fc7fcdb

avik-pal requested a review from gdalle October 1, 2024 21:22

chore: apply suggestions from code review

607f5c4

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

avik-pal force-pushed the ap/sparse_di branch from 973dd1f to 607f5c4 Compare October 1, 2024 21:23

gdalle reviewed Oct 1, 2024

View reviewed changes

chore: apply suggestions from code review

35bea3e

avik-pal requested a review from ChrisRackauckas October 1, 2024 23:13

avik-pal mentioned this pull request Oct 2, 2024

perf: testing the newer nonlinear solve versions SciML/SciMLBenchmarks.jl#1077

Merged

test: structured jacobians

fb8b1ee

avik-pal and others added 3 commits October 2, 2024 12:55

feat: using DI for structured Jacobians

294ec09

docs: add a table to guarantee selections

7836f04

chore: apply suggestions from code review

89b679c

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

avik-pal linked an issue Oct 2, 2024 that may be closed by this pull request

Start using DifferentiationInterface.jl #415

Closed

avik-pal and others added 2 commits October 2, 2024 13:16

fix: remove stale load

76fe551

chore: apply formatting suggestion

acf377f

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

avik-pal added 4 commits October 3, 2024 09:37

docs: remove Symbolics and SparseDiffTools

1356cfe

docs: remove unnecessary ADTypes docs

8449406

test: remove sparsedifftools and symbolics from tests

fa575c6

refactor: remove Zygote extension

d6b5536

avik-pal force-pushed the ap/sparse_di branch from 2ad7a1d to d6b5536 Compare October 3, 2024 14:12

docs: add documenter interlinks as a dep

c562fa1

avik-pal force-pushed the ap/sparse_di branch from 4e0b3b8 to c562fa1 Compare October 3, 2024 14:33

docs: fix external references

6dcb0da

avik-pal force-pushed the ap/sparse_di branch from d77da29 to 6dcb0da Compare October 3, 2024 15:13

avik-pal merged commit f1969a2 into master Oct 3, 2024
33 of 36 checks passed

avik-pal deleted the ap/sparse_di branch October 3, 2024 17:07

gdalle mentioned this pull request Oct 3, 2024

NoColoringAlgorithm not supported by DI yet #471

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use DifferentiationInterface for sparse AD #468

feat: use DifferentiationInterface for sparse AD #468

avik-pal commented Oct 1, 2024 •

edited

Loading

gdalle left a comment

avik-pal commented Oct 1, 2024

avik-pal commented Oct 1, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024 •

edited

Loading

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024 •

edited

Loading

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

oscardssmith commented Oct 2, 2024

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

oscardssmith commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

oscardssmith commented Oct 3, 2024

gdalle commented Oct 3, 2024 •

edited

Loading

feat: use DifferentiationInterface for sparse AD #468

feat: use DifferentiationInterface for sparse AD #468

Conversation

avik-pal commented Oct 1, 2024 • edited Loading

TODO:

gdalle left a comment

Choose a reason for hiding this comment

avik-pal commented Oct 1, 2024

avik-pal commented Oct 1, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024 • edited Loading

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024 • edited Loading

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

oscardssmith commented Oct 2, 2024

avik-pal commented Oct 2, 2024

avik-pal commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

oscardssmith commented Oct 2, 2024

gdalle commented Oct 2, 2024

gdalle commented Oct 2, 2024

oscardssmith commented Oct 3, 2024

gdalle commented Oct 3, 2024 • edited Loading

avik-pal commented Oct 1, 2024 •

edited

Loading

gdalle commented Oct 2, 2024 •

edited

Loading

gdalle commented Oct 2, 2024 •

edited

Loading

gdalle commented Oct 3, 2024 •

edited

Loading