Reproducing "Label-Free Explainability for Unsupervised Models"

This repository is a reproduction of the Label-Free Explainability for Unsupervised Models. It is heavily based on the authors originally code, which is available here.

Setup

We've saved the conda environment we used, so it can be reinstalled with

conda env create --file environment.yml

Requirements

While some training require more GPU RAM, the inference runs described below can be ran on a 10GB GPU. The repository contains the majority of our pretrained models, so it needs approximately 400MB of disk space.

Experiments

The structure of this repository can best be understood in relation to our accompanying report.

In the below, we have indicated how each experiment in the Methodology (section 3.1 of the report) can be run using this codebase.

By default each training script will save the relevant figures and models into a directory under results/. However, when running the same scripts in inference mode, the scripts will use these existing models to quickly generate the plots used in our paper. These plot will show up under the figures/ folder.

(Claim 1.1) Feature importance consistency

These experiments produce Figure 1 in the report, which shows how masking important features affects representation shift in a number of datasets.

# MNIST
python experiments/mnist.py --name consistency_features --inference

# ECG5000
python experiments/ecg5000.py --name consistency_features --inference

# CIFAR10
python experiments/cifar100_from_mnist.py --name consistency_features --data cifar10 --inference

# CIFAR100
python experiments/cifar100_from_mnist.py --name consistency_features --data cifar100 --inference

(Claim 1.2) Example importance consistency

These experiments produce Figure 2 in the report, which show how similarity rates are affected by example importance metrics. Note however, in order to save approximately 99% of the computation in inference mode, we have removed the gradient-calculated methods here and only kept the nearest neighbor approaches. If you want a full run, make sure you have enough space on your GPU, and then by removing the --inference tag you can make a full run.

# MNIST
python experiments/mnist.py --name consistency_examples --inference

# ECG5000
python experiments/ecg5000.py --name consistency_examples --inference

# CIFAR10
python experiments/cifar100_from_mnist.py --name consistency_examples --data cifar10 --inference

# CIFAR100
python experiments/cifar100_from_mnist.py --name consistency_examples --data cifar100 --inference

(Claims 2.1 and 2,2): Correlation of feature and example importance scores across pretext tasks

These scripts produce Tables 2, 3 and 4 from the report, which show the Pearson correlation of feature importance and example importance scores when latent representations are trained under different pretexts.

# MNIST
python experiments/mnist.py --name pretext --inference

# CIFAR10
python experiments/cifar100_from_mnist.py --name pretext --data cifar10 --inference

# CIFAR100
python experiments/cifar100_from_mnist.py --name pretext --data cifar100 --inference

(Claim 3) Disentanglement

These scripts produce Figure 3, which shows how the Pearson correlation of feature importance scares between the latent units of disentangled VAEs.

We have additionally implemented this analysis on CIFAR10 and CIFAR100, although we did not have room to include this in our final report.

# MNIST
python experiments/mnist.py --name disvae --inference

# Dsprites
python experiments/dsprites.py --inference

# CIFAR10
python experiments/cifar100_from_mnist.py --name disvae --data cifar10 --inference

# CIFAR100
python experiments/cifar100_from_mnist.py --name disvae --data cifar100 --inference

Claim 3 extension: Lucid

Running lucid visualisation on the trained VAE networks. In order to do this, first you need to run either experiment from (Claim 3) Disentanglement. Then, you can visualise run1 with lucid by pointing at the result folder, in which the script will also save the results.

Note that for this experiment, you need to first install the torch lucent module:

pip install torch-lucent

And then change a line in the module to make it work with MNIST and VAE. This if branch needs to be commented out. If that's done, you can run the lucid experiment with

# Choose data from mnist, cifar10, cifar100
python experiments/lucid.py path/to/vae/folder --data mnist

Additional Experiment - Comparison of Unsupervised and Supervised Feature Importance

This code is used to produce Figure 5, which shows how feature importance correlates between latent encoders and full models.

The analysis is best run using the notebook found in extensions/encoder_decoder_correlations/first_class_encoder_vs_decoder_correlations.ipynb.

This notebook imports its codebase from extensions/encoder_decoder_correlations/encoder_decoder_correlations.py.

Cite

anonymous2023reproducibility,
title={Reproducibility Study of {\textquotedblright}Label-Free Explainability for Unsupervised Models{\textquotedblright}},
author={Papp, Gergely and Wagenbach, Julius and Jans de Vries, Laurens and Mather, Niklas},
booktitle={ML Reproducibility Challenge 2022},
year={2023},
url={https://openreview.net/forum?id=n2qXFXiMsAM}
}

jwagenbach/FACT