/vae_ssl

Scalable semi-supervised learning with deep variational autoencoders

Primary LanguageJupyter NotebookMIT LicenseMIT

Scalable Semi-Supervised Learning with Variational Autoencoders

This repo implements a series of semi-supervised variational autoencoding models in Theano. It uses theanomodels as an experimental framework.

Setup

First, make sure pip and setuptools are up to date:

pip install -U pip setuptools

Next, setup theanomodels:

git clone https://github.com/clinicalml/theanomodels
cd theanomodels
git checkout 9326bf996b7e2cf622d44ed75c62a5cd7c0dea5a
pip install -e .

Finally, setup vae_ssl:

cd ../
git clone https://github.com/clinicalml/vae_ssl
cd vae_ssl
pip install -r requirements.txt

Training

As an example, to train a Gumbel Softmax model conceptually similar to the architecture in [2], use the command:

python train.py -model GumbelSoftmaxM2 

This will train a semi-supervised variational autoencoder on MNIST using 10 random training samples from each digit class as labeled examples, for a total of 100 labeled samples, and all other training samples as unlabeled samples.

See results.ipynb for an example of how to use output results.

Example Results

All models, except MLP, are semi-supervised variational autoencoders. The autoencoders are trained on MNIST using 10 random training samples from each digit class as labeled examples, for a total of 100 labeled samples, and all other training samples as unlabeled samples. The MLP is trained on just the 100 labeled samples.

Validation Accuracy

Each line represents results for a given model averaged over 10 random seeds across validation epochs.

Samples

Samples from both p(x) and p(x,y) of the LogisticNormalM2 model at the end of training:

References

[1] Kingma, Diederik P., et al. "Semi-supervised learning with deep generative models." Advances in Neural Information Processing Systems. 2014.

[2] Jang, Eric, Shixiang Gu, and Ben Poole. "Categorical reparameterization with gumbel-softmax." arXiv preprint arXiv:1611.01144 (2016).