Force ICNN to adopt default initialization of its own layers #551

Algue-Rythme · 2024-06-21T15:11:50Z

The ICNN used to rely on initialisation with normal matrices. Now, it fallbacks to the behavior of the layers, i.e lecun_normal, which scales the standard deviation of the weights with 1/sqrt(fan_in). This std is much smaller for wide networks.

codecov · 2024-06-21T15:58:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.39%. Comparing base (787d4a9) to head (27c582a).
Report is 39 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #551      +/-   ##
==========================================
+ Coverage   91.38%   91.39%   +0.01%     
==========================================
  Files          69       69              
  Lines        7242     7242              
  Branches     1019     1018       -1     
==========================================
+ Hits         6618     6619       +1     
  Misses        472      472              
+ Partials      152      151       -1

Files with missing lines	Coverage Δ
src/ott/neural/networks/icnn.py	`94.54% <100.00%> (ø)`

... and 1 file with indirect coverage changes

michalk8

Thanks @Algue-Rythme , LGTM !

Was broken by #551

* Start batched vmap * Initial `batched_vmap` impl * Nicer formatting * Fix getting shape * Remove private API usage * Fix new args * Add a TODO * Canonicalize axes * Add `batched_vmap` to docs * Removed batched transport functions * Remove `_norm_{x,y}` from `CostFn` * Implement `apply_lse_kernel` * Implememt `apply_kernel` * Implement `apply_cost` * Remove old functions * Make function private * Refactor `apply_cost` to have consistent shapes * Use `_apply_cost_to_vec` in `PointCloud` * Remoeve TODO * Formatting * Simplify `_apply_sqeucl_cost` * Fix `RecusionError` * Remove docstring of a private method * Fix `apply_lse_kernel` * Squeeze only 1 axis of the cost * Add TODO * Rename function, make a property * Remove unused helper function * Compute mean summary online * Compute mean online * Compute max cost matrix * Update error message * Remove TODO * Flatten out axes * Fix missing cross terms in the costs * Fix geom tests * Fix dtype * Start implementing transport functions * Implement online transport functions * Fix solver tests * Fix Bures test * Don't use `pairwise` in tests * Update notebook that uses `norm` * Fix bug in `UnbalancedBures` * Rename `pairwise -> __call__` * Remove old shape code * Always instantiate the cost for online * Remove old TODO * Extract `_apply_cost_to_vec_fast` * Update max cost in LRCGeom * Fix test, use more `multi_dot` * Remove `batch_size` from `LRCGeometry` * Add better warning error * Reorder properties * Add docs to `batched_vmap` * Start adding tests * Reorder functions in test * Fix axes, add a test * Update test fn * Move out assert * Dont canon out_axes * Check max traces * Test memory of batched vmap * Install `typing_extensions` * Remove `.` from description * Add more `out_axes` tests * Add `in_axes` test * Fix negative axes * Increase memory limit in the test * Add in_axes pytree test * Remove old warnings filters * Update fixtures * Update SqEucl cost. * Update docstrings * Remove unused imports from the docs * Revert test pre-commits * Fix ICNN init notebook Was broken by #551 * Improve error message

Force ICNN to adopt default initialization of its own layers

27c582a

michalk8 self-requested a review June 25, 2024 09:26

michalk8 approved these changes Jun 25, 2024

View reviewed changes

michalk8 merged commit d3b6c40 into main Jun 25, 2024
13 checks passed

michalk8 deleted the fixdefaultinit branch June 25, 2024 09:26

michalk8 pushed a commit that referenced this pull request Jun 27, 2024

Force ICNN to adopt default initialization of its own layers (#551)

3fe7d04

michalk8 added a commit that referenced this pull request Oct 16, 2024

Fix ICNN init notebook

5450808

Was broken by #551

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force ICNN to adopt default initialization of its own layers #551

Force ICNN to adopt default initialization of its own layers #551

Algue-Rythme commented Jun 21, 2024

codecov bot commented Jun 21, 2024 •

edited

Loading

michalk8 left a comment

Force ICNN to adopt default initialization of its own layers #551

Force ICNN to adopt default initialization of its own layers #551

Conversation

Algue-Rythme commented Jun 21, 2024

codecov bot commented Jun 21, 2024 • edited Loading

Codecov Report

michalk8 left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 21, 2024 •

edited

Loading