Tree-based Diffusion Schrödinger Bridge

This is the official code for the paper 'Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters'. It extends the framework of Diffusion Schrödinger Bridge [1] to any tree-structured joint distribution with known marginals on the leaves (thus including the classical Schrödinger Bridge problem). By considering star-shaped trees, it enables to compute regularized Wasserstein-2 barycenters for high-dimensional empirical probability distributions, which is of main interest in Optimal Transport (OT). Our method is competitive with respect to state-of-the-art regularized algorithms from [2] and [3] in high-dimensional settings.

In our setting, each edge of the tree is parameterized by two neural networks, which model the forward and backward drifts of the diffusion processes. In theory, this requires to consider 2M neural networks, where M stands for the number of edges in the tree. To avoid any memory issue in practice, our code only requires to consider 2 active neural networks at each stage of the training process.

Illustration (2D)

The following plots were obtained by considering the dataset 'Swiss Roll' as the first root in the training process. The corresponding models are saved in the directory ./checkpoints_model.

Swiss Roll	Circle	Moons	Setting
			Estimation of the leaves (OT reg=0.1, 50 mIPF cycles).
			Estimation of the barycenter (OT reg=0.05, 60 mIPF cycles).
			Estimation of the barycenter (OT reg=0.1, 50 mIPF cycles).
			Estimation of the barycenter (OT reg=0.2, 50 mIPF cycles).

We provide below barycenter plots obtained by other methods.

Free-support exact barycenter [2]	Free-support regularized barycenter [2]	Convolutional regularized barycenter [4]

Illustration (MNIST)

The following plots were obtained by considering the dataset 'MNIST 6' as the first root in the training process.

MNIST 2	MNIST 4	MNIST 6	Setting
			Estimation of the leaves (OT reg=0.5, 10 mIPF cycles).
			Estimation of the barycenter (OT reg=0.5, 10 mIPF cycles).

Contributors

Maxence Noble
Valentin De Bortoli

Installation

This project can be installed from its git repository.

Obtain the sources by:

git clone [email protected]:maxencenoble/tree-diffusion-schrodinger-bridge.git

You may modify requirements.txt according to your CUDA version.

Install the packages via an Anaconda environment:

conda create -n tree_dsb python=3.8
conda activate tree_dsb
pip install -r requirements.txt

How to use this code?

For CELEBA, make sure that you already have in the path ./data/celeba the 6 required files:

list_landmarks_align_celeba.txt
list_eval_partition.txt,
list_bbox_celeba.txt,
list_attr_celeba.txt,
img_align_celeba.zip,
identity_CelebA.txt.

You can find them at https://drive.google.com/drive/folders/0B7EVK8r0v71pWEZsZE9oNnFzTm8?resourcekey=0-5BR16BdXnb8hVj6CNHKzLg.

Download datasets:

MNIST: python data.py --data mnist
CELEBA: python data.py --data celeba
Posterior aggregation (already available): python data_posterior.py --data wine --splitting hom/het

Change the configuration files:

./config/config.yaml: SDE/ODE settings for plots, initialisation setting, corrector setting
./config/dataset/: specific settings for each dataset (OT regularization, starting root, training parameters, checkpoints...)
./config/model/: specific setting for each model (fully connected neural network 'Basic' or UNET)

The size of the cache dataset used to obtain samples in the training stage is given by : cache_npar x num_cache_batches x num_steps x SHAPE, where SHAPE is the shape of the samples. Make sure that the parameter num_workers fits on your machine. If GPU has insufficient memory, then reduce the cache size.

Train models and save plots:

2d, 2 datasets - Bridge (CPU): python train_model.py dataset=2d_bridge model=Basic tree=Bridge
2d, 3 datasets - Barycenter (CPU): python train_model.py dataset=2d_3datasets model=Basic tree=Barycenter
Gaussian, 3 datasets - Barycenter (CPU, no plot): python train_model.py dataset=gaussian_3datasets model=Basic tree=Barycenter
Posterior, 3 datasets - Barycenter (CPU, no plot): python train_model.py dataset=posterior_3datasets model=Basic tree=Barycenter
MNIST, 2 datasets - Barycenter (GPU):python train_model.py dataset=stackedmnist_2datasets model=UNET tree=Barycenter
MNIST, 3 datasets - Barycenter (GPU):python train_model.py dataset=stackedmnist_3datasets model=UNET tree=Barycenter
CELEBA, 2 datasets - Barycenter (GPU):python train_model.py dataset=celeba_2datasets model=UNET tree=Barycenter

Checkpoints and sampled images will be saved to a newly created directory named experiments.

Use checkpoint models:

Make sure that the pretrained models are saved according to the structure of the tree you are considering (ie, the directory of checkpoints for this experiment has local directories source=...,dest=.../, each one containing networks for the forward and the backward sampling directions that match datasets).
Set checkpoint_run to True in the dataset configuration file.

In this repository, there are 3 sets of pretrained models for the setting 2d_3datasets, staring from the root swiss, with equal barycenter weights, each one corresponding to a certain level of OT regularization (epsilon=0.2, 0.1, 0.05). To use them, make sure that you modify the following parameters in the dataset configuration file:

epsilon
checkpoints_dir
checkpoints_f, checkpoint_b

Train models from pretrained models:

Follow Step 4.
Set start_n_ipf to the mIPF cycle corresponding to the pretrained models.

Checkpoints and sampled images will be saved to a newly created directory named experiments.

Test pretrained models:

Follow Step 4.
2d, 2 datasets - Bridge (CPU): python test_model.py dataset=2d_bridge model=Basic tree=Bridge
2d, 3 datasets - Barycenter (CPU): python test_model.py dataset=2d_3datasets model=Basic tree=Barycenter
Gaussian, 3 datasets - Barycenter (CPU, no plot): python test_model.py dataset=gaussian_3datasets model=Basic tree=Barycenter
Posterior, 3 datasets - Barycenter (CPU, no plot): python test_model.py dataset=posterior_3datasets model=Basic tree=Barycenter
MNIST, 2 datasets - Barycenter (GPU):python test_model.py dataset=stackedmnist_2datasets model=UNET tree=Barycenter
MNIST, 3 datasets - Barycenter (GPU):python test_model.py dataset=stackedmnist_3datasets model=UNET tree=Barycenter
CELEBA, 2 datasets - Barycenter (GPU):python test_model.py dataset=celeba_2datasets model=UNET tree=Barycenter

Checkpoints and sampled images will be saved to a newly created directory named experiments.

Check the setting in run_free_support_barycenter.py and compare with the method from [2]:

2d, 3 datasets - Barycenter (CPU): python run_free_support_barycenter.py --data 2d
Gaussian, 3 datasets - Barycenter (CPU, no plot): python run_free_support_barycenter.py --data gaussian
Posterior, 3 datasets - Barycenter (CPU, no plot): python run_free_support_barycenter.py --data posterior

Citation

If you use this code, please cite the following (BibTex format):

@article{noble2024tree,
  title={Tree-Based Diffusion Schr{\"o}dinger Bridge with Applications to Wasserstein Barycenters},
  author={Noble, Maxence and De Bortoli, Valentin and Doucet, Arnaud and Durmus, Alain},
  journal={Advances in Neural Information Processing Systems},
  volume={36},
  year={2024}
}

References

[1] V. De Bortoli, J. Thornton, J. Heng & A. Doucet, Diffusion Schrödinger bridge with applications to score-based generative modeling, Advances in Neural Information Processing Systems, 2021.

[2] M. Cuturi & A. Doucet, Fast computation of Wasserstein barycenters, International conference on machine learning, 2014.

[3] L. Li, A. Genevay, M. Yurochkin & J. Solomon, Continuous regularized Wasserstein barycenters, Advances in Neural Information Processing Systems, 2020.

[4] J. Solomon, F. De Goes, G. Peyré, M. Cuturi, A. Butscher, A. Nguyen, & L. Guibas Convolutional wasserstein distances: Efficient optimal transportation on geometric domains, ACM Transactions on Graphics, 2015.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tree-based Diffusion Schrödinger Bridge

Illustration (2D)

Illustration (MNIST)

Contributors

Installation

How to use this code?

Citation

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.idea		.idea
bridge		bridge
checkpoints_data		checkpoints_data
checkpoints_model		checkpoints_model
conf		conf
data		data
images		images
posterior_utils		posterior_utils
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
data_posterior.py		data_posterior.py
requirements.txt		requirements.txt
run_free_support_barycenter.py		run_free_support_barycenter.py
test_model.py		test_model.py
train_model.py		train_model.py

License

maxencenoble/tree-diffusion-schrodinger-bridge

Folders and files

Latest commit

History

Repository files navigation

Tree-based Diffusion Schrödinger Bridge

Illustration (2D)

Illustration (MNIST)

Contributors

Installation

How to use this code?

Citation

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages