Add quadratic layers and enhance ICNNs, update tutorial #477

nvesseron · 2023-12-01T00:26:53Z

As discussed during the hackaton, we updated the code of quadratic layers and fix the bug described in issue #463. We added these layers in the ICNN class and updated the tutorial neural_dual.

review-notebook-app · 2023-12-01T00:26:58Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

MUCDK · 2023-12-01T08:11:15Z

Amazing Nina! @michalk8 can we wait with this PR, and then update the ICNN such that it inherits from the new base classes?

codecov · 2023-12-01T09:35:21Z

Codecov Report

Attention: 10 lines in your changes are missing coverage. Please review.

Comparison is base (ef3b544) 90.43% compared to head (b702b27) 90.53%.
Report is 1 commits behind head on main.

❗ Current head b702b27 differs from pull request most recent head 20bfa12. Consider uploading reports for the commit 20bfa12 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #477      +/-   ##
==========================================
+ Coverage   90.43%   90.53%   +0.09%     
==========================================
  Files          60       60              
  Lines        6546     6551       +5     
  Branches      930      933       +3     
==========================================
+ Hits         5920     5931      +11     
+ Misses        480      472       -8     
- Partials      146      148       +2

Files	Coverage Δ
src/ott/neural/solvers/neuraldual.py	`60.49% <50.00%> (+2.70%)`	⬆️
src/ott/problems/linear/potentials.py	`91.66% <66.66%> (-0.55%)`	⬇️
src/ott/neural/models.py	`94.36% <93.18%> (-0.44%)`	⬇️
src/ott/neural/layers.py	`91.30% <90.00%> (+2.62%)`	⬆️

marcocuturi

thanks Nina, starting with a minor review.

I am wondering if we may not want to provide flexibility, when creating the ICNN, to define a sequence of dimensions, but also a sequence of what you called rank (i.e. rank of linear operator that's "squared" to have a PSD operator on top of diagonal.)

I would also suggest to call rank something a bit different (it's not the rank of the PSD operator, since that takes into account the diagonal element as well). same for d_i

maybe
d_i-> diagonal
A_i -> matrix or lr_matrix
rank -> rank_matrix or rank_lr_matrix

src/ott/neural/layers.py

src/ott/neural/models.py

src/ott/problems/linear/potentials.py

marcocuturi

thanks so much Michal! minor comments, LGTM

marcocuturi · 2023-12-19T21:48:24Z

src/ott/neural/layers.py

-  precision: Any = None
-  kernel_init: Optional[Callable[[PRNGKey, Shape, Dtype], Array]] = None,
-  bias_init: Callable[[PRNGKey, Shape, Dtype], Array] = nn.initializers.zeros
+  kernel_init: Callable[[PRNGKey, Shape, Dtype], Array] = DEFAULT_KERNEL_INIT


i think it is counterproductive to use a default initializer for this layer that has symmetric values. Here this will result in half of entries that will be below 0, and whose gradients will likely vanish quite quickly. See e.g https://openreview.net/pdf?id=pWZ97hUQtQ . Although I am not sure what we could use, it seemes that initializing by default with absolute value of a the default seems more appropriate. Another legit option would be to normalize any kernel matrix with row values summing to 1.

How about, for simpliciy, a truncated normal with low=0.0?

src/ott/neural/layers.py

marcocuturi · 2023-12-19T22:05:46Z

src/ott/neural/models.py

-      z = self.act_fn(z)
-    z += self.pos_def_potential(x)
-    return z.squeeze()
+    source, target = self.gaussian_map_samples


something i am a bit unsure about is, in the current setup, the gaussian quadratic potential (i.e. just one) is added at the very end, regardless of all the rest. Is there a way to ensure its scale is therefore reasonable / meaningful / large enough compared to the rest of the initialization?

Good point, will check #90 for answers

michalk8 · 2023-12-19T23:30:22Z

I am wondering if we may not want to provide flexibility, when creating the ICNN

Added this

michalk8 · 2023-12-19T23:42:57Z

I will need to regenerate the notebooks, will merge after, thanks @nvesseron for the additions and @marcocuturi for the review.

michalk8

Merging now, let's move the discussion for initialization for another issue/PR.

* fix a bug when bias is False * update the PosDefPotentials class * added icnn adjustments * neuraldual fix freezee weights * use relu by default as activation function and rectifier_fn * updates * Update neural layers * Clean ICNN impl. * Revert changes in the potentials * Fix D102 * Fix indentation * Remove `;` * Use tensordot * Update docs * First rounds of test fixing * Fix rest of the tests * Revert assertion * Polish more docs * Fix docs linter * Fix links in neuraldual notebook * Fix links in the rest of the neural docs * Update docs * Allow ranks to be a tuple * Remvoe note * Fix MetaMLP * Rerun neural notebooks * Fix rendering --------- Co-authored-by: lucaeyring <[email protected]> Co-authored-by: Michal Klein <[email protected]>

nvesseron and others added 9 commits November 19, 2023 21:34

fix a bug when bias is False

b2b7fa6

update the PosDefPotentials class

b7674b8

update PosDefPotentials

3bdfbc0

added icnn adjustments

4724bf0

neuraldual fix freezee weights

cd5c573

Merge branch 'ott-jax:main' into fix_bug_quad_layer

3b6bb61

use relu by default as activation function and rectifier_fn

051b222

updates

b47cb66

solved conflicts

c86e135

michalk8 assigned nvesseron Dec 1, 2023

michalk8 added the enhancement New feature or request label Dec 1, 2023

marcocuturi reviewed Dec 10, 2023

View reviewed changes

michalk8 added 10 commits December 19, 2023 12:14

Update neural layers

73e4599

Clean ICNN impl.

deda6a2

Revert changes in the potentials

ada8983

Fix D102

7aa580f

Fix indentation

9afdd4f

Remove ;

a36a014

Use tensordot

6b5f73b

Update docs

ad84878

First rounds of test fixing

1f9d886

Fix rest of the tests

95844b6

michalk8 requested a review from marcocuturi December 19, 2023 15:58

michalk8 added 4 commits December 19, 2023 17:48

Revert assertion

f971859

Polish more docs

092906c

Fix docs linter

9e8fe14

Fix links in neuraldual notebook

e05d54d

Fix links in the rest of the neural docs

b9650a9

marcocuturi approved these changes Dec 19, 2023

View reviewed changes

michalk8 added 2 commits December 19, 2023 23:56

Update docs

a5febbb

Allow ranks to be a tuple

c453cb9

michalk8 and others added 4 commits December 20, 2023 00:43

Remvoe note

b702b27

Fix MetaMLP

353d0b4

Rerun neural notebooks

94df50f

Fix rendering

20bfa12

michalk8 self-requested a review December 20, 2023 18:39

michalk8 approved these changes Dec 20, 2023

View reviewed changes

michalk8 merged commit b6cf039 into ott-jax:main Dec 20, 2023
4 of 11 checks passed

nvesseron deleted the fix_bug_quad_layer branch August 4, 2024 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add quadratic layers and enhance ICNNs, update tutorial #477

Add quadratic layers and enhance ICNNs, update tutorial #477

nvesseron commented Dec 1, 2023

review-notebook-app bot commented Dec 1, 2023

MUCDK commented Dec 1, 2023

codecov bot commented Dec 1, 2023 •

edited

Loading

marcocuturi left a comment

marcocuturi left a comment

marcocuturi Dec 19, 2023

michalk8 Dec 19, 2023

marcocuturi Dec 19, 2023

michalk8 Dec 19, 2023

michalk8 commented Dec 19, 2023

michalk8 commented Dec 19, 2023

michalk8 left a comment

Add quadratic layers and enhance ICNNs, update tutorial #477

Add quadratic layers and enhance ICNNs, update tutorial #477

Conversation

nvesseron commented Dec 1, 2023

review-notebook-app bot commented Dec 1, 2023

MUCDK commented Dec 1, 2023

codecov bot commented Dec 1, 2023 • edited Loading

Codecov Report

marcocuturi left a comment

Choose a reason for hiding this comment

marcocuturi left a comment

Choose a reason for hiding this comment

marcocuturi Dec 19, 2023

Choose a reason for hiding this comment

michalk8 Dec 19, 2023

Choose a reason for hiding this comment

marcocuturi Dec 19, 2023

Choose a reason for hiding this comment

michalk8 Dec 19, 2023

Choose a reason for hiding this comment

michalk8 commented Dec 19, 2023

michalk8 commented Dec 19, 2023

michalk8 left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 1, 2023 •

edited

Loading