introduce multivariate cdf / quantiles #447

marcocuturi · 2023-10-25T10:01:30Z

The idea comes from the great work of Marc Hallin and colleagues

The implementation essentially relies on entropic maps (Pooladian/Niles Weed) to approximate both CDF and Quantiles Monge maps (forward and backward between input measure and reference/uniform measure).

We can expect performance to degrade with dimension + sensitivity w.r.t. epsilon, and this can be observed in tests (hence fairly loose atol values).

codecov · 2023-10-25T11:08:18Z

Codecov Report

Merging #447 (82f700a) into main (369db8c) will increase coverage by 0.02%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #447      +/-   ##
==========================================
+ Coverage   90.56%   90.59%   +0.02%     
==========================================
  Files          57       57              
  Lines        6256     6274      +18     
  Branches      884      888       +4     
==========================================
+ Hits         5666     5684      +18     
  Misses        448      448              
  Partials      142      142

Files	Coverage Δ
src/ott/tools/soft_sort.py	`96.07% <100.00%> (+0.52%)`	⬆️

src/ott/tools/soft_sort.py

tests/tools/soft_sort_test.py

src/ott/tools/soft_sort.py

tests/tools/soft_sort_test.py

src/ott/tools/soft_sort.py

marcocuturi · 2023-10-25T20:20:57Z

src/ott/tools/soft_sort.py

@@ -499,31 +500,33 @@ def multivariate_cdf_quantile_maps(
      to be uniform by default.
    kwargs: keyword arguments passed on to the :func:`~ott.solvers.linear.solve`
      function, which solves the OT problem between ``inputs`` and ``targets``
-      using the Sinkhorn algorithm.
+      using the :class:`~ott.solvers.linear.sinkhorn.Sinkhorn` algorithm.


here i wasn't sure about the reference, because we use solve... that being said, probably not a good idea to use LR on this, because would crash :) so maybe an instance where we should force kwargs to only refer to sinkhorn...

marcocuturi · 2023-10-25T20:21:16Z

src/ott/tools/soft_sort.py

@@ -479,12 +480,12 @@ def multivariate_cdf_quantile_maps(

  Args:
    inputs: 2D array of ``[n, d]`` vectors.
-    target_sampler: Callable that takes a ``key`` and ``[m,d]`` shape.
+    target_sampler: Callable that takes a ``rng`` and ``[m, d]`` shape.


sorry about this one!!!

marcocuturi · 2023-10-25T20:22:33Z

thanks a lot for the last fixes Michal!

* introduce multiv cdf / quantiles * fix online memory * incorporate feedback from review * various fixes * remove doi * adding jittability * adding docs * Fix docstrings * Update test * Fix spellchecker --------- Co-authored-by: Michal Klein <[email protected]>

* Bump `jax>=0.4` * Update Docker image * Change GPU test name * Fix typo * Don't pre-allocate memory on GPU * Update step name * Fix GPU device number and jax installation * introduce multivariate cdf / quantiles (#447) (#449) * introduce multiv cdf / quantiles * fix online memory * incorporate feedback from review * various fixes * remove doi * adding jittability * adding docs * Fix docstrings * Update test * Fix spellchecker --------- Co-authored-by: Michal Klein <[email protected]>

* introduce multiv cdf / quantiles * fix online memory * incorporate feedback from review * various fixes * remove doi * adding jittability * adding docs * Fix docstrings * Update test * Fix spellchecker --------- Co-authored-by: Michal Klein <[email protected]>

* Bump `jax>=0.4` * Update Docker image * Change GPU test name * Fix typo * Don't pre-allocate memory on GPU * Update step name * Fix GPU device number and jax installation * introduce multivariate cdf / quantiles (#447) (#449) * introduce multiv cdf / quantiles * fix online memory * incorporate feedback from review * various fixes * remove doi * adding jittability * adding docs * Fix docstrings * Update test * Fix spellchecker --------- Co-authored-by: Michal Klein <[email protected]>

introduce multiv cdf / quantiles

71bd777

fix online memory

52571b2

michalk8 self-requested a review October 25, 2023 11:10

michalk8 assigned marcocuturi Oct 25, 2023

michalk8 added the enhancement New feature or request label Oct 25, 2023

michalk8 requested changes Oct 25, 2023

View reviewed changes

incorporate feedback from review

10967a5

michalk8 requested changes Oct 25, 2023

View reviewed changes

marcocuturi and others added 7 commits October 25, 2023 15:55

various fixes

272a20a

remove doi

8ef9593

adding jittability

6a3dbdc

adding docs

0d50f93

Fix docstrings

9959613

Update test

95b5940

Fix spellchecker

82f700a

marcocuturi commented Oct 25, 2023

View reviewed changes

marcocuturi merged commit 3706511 into main Oct 25, 2023

marcocuturi deleted the multivq branch October 31, 2023 11:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

introduce multivariate cdf / quantiles #447

introduce multivariate cdf / quantiles #447

marcocuturi commented Oct 25, 2023

codecov bot commented Oct 25, 2023 •

edited

Loading

marcocuturi Oct 25, 2023

marcocuturi Oct 25, 2023

marcocuturi commented Oct 25, 2023

introduce multivariate cdf / quantiles #447

introduce multivariate cdf / quantiles #447

Conversation

marcocuturi commented Oct 25, 2023

codecov bot commented Oct 25, 2023 • edited Loading

Codecov Report

marcocuturi Oct 25, 2023

Choose a reason for hiding this comment

marcocuturi Oct 25, 2023

Choose a reason for hiding this comment

marcocuturi commented Oct 25, 2023

codecov bot commented Oct 25, 2023 •

edited

Loading