How to Use

Tensorflow (incomplete) replication of the paper Variational Dropout Sparsifies Deep Neural Networks, based on their code. Written to be relatively easy to apply.

How to Use

This only implements two types of layers at the moment, fully connected and 2D convolutional. Example usage is in mnist.py. We are following the tensorflow docs on variable reuse, so individual layers must have their own variable_scope. So, from the mnist.py script:

import variational_dropout as vd

with tf.variable_scope('fc2'):
    y_conv = vd.fully_connected(h_fc1, phase, 10)

The phase variable is used to switch between training and test time behaviours, typically using a placeholder. True is training time, and the noise variables will be sampled based on the current variational parameters. False is test time, and weights will be masked based on the current variational parameters. Training time is stochastic, while test is deterministic.

To train with variational dropout, the loss function must also include the KL divergence between the approximate posterior and the prior. You can think of this as a (kind of) theoretically justified regulariser. There is a function to gather the log_alpha (vd.gather_log_alphas()) variables that parameterise the approximate posterior and another to estimate this KL divergence. A typical way to calculate it and add it to the loss is given in the mnist.py script:

# prior DKL part of the ELBO
log_alphas = vd.gather_logalphas(tf.get_default_graph())
divergences = [vd.dkl_qp(la) for la in log_alphas]
# combine to form the ELBO
N = float(mnist.train.images.shape[0])
dkl = tf.reduce_sum(tf.stack(divergences))
elbo = cross_entropy+(1./N)*dkl

This is not scaled correctly to be a true ELBO, but it's not really relevant considering the arbitrary choice of learning rate.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cifar10.py		cifar10.py
cifar10_eval.py		cifar10_eval.py
cifar10_input.py		cifar10_input.py
cifar10_train.py		cifar10_train.py
mnist.py		mnist.py
variational_dropout.py		variational_dropout.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to Use

About

Releases

Packages

Languages

License

BayesWatch/tf-variational-dropout

Folders and files

Latest commit

History

Repository files navigation

How to Use

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages