ViV1T: dynamic mouse V1 response prediction using a factorized spatiotemporal Transformer

Codebase for the ViV1T (team dunedin) submission in the NeurIPS Sensorium 2023 challenge which came 🥉 place.

Contributors: Bryan M. Li, Wolf De Wulf, Nina Kudryashova, Matthias Hennig, Nathalie L. Rochefort, Arno Onken.

Acknowledgments

We sincerely thank Turishcheva et al. for organizing the Sensorium 2023 challenge and for making their high-quality large-scale mouse V1 recordings publicly available. The structure of this codebase is based on and inspired by bryanlimy/V1T, ecker-lab/sensorium_2023, sinzlab/neuralpredictors and sinzlab/nnfabrik.

File structure

The codebase repository has the following structure.

Check data/README.md for more information about the dataset and how to store them.
runs/ contains model checkpoints and their training logs.
- You can download the 5 model checkpoints used for our submission in the Sensorium 2023 challenge at huggingface.co/bryanlimy/ViV1T.

.
├── LICENSE
├── README.md
├── assets
│   └── viv1t.jpg
├── data
│   ├── README.md
│   └── sensorium
│       ├── dynamic29156-11-10-Video-8744edeac3b4d1ce16b680916b5267ce
│       ├── dynamic29228-2-10-Video-8744edeac3b4d1ce16b680916b5267ce
│       ├── dynamic29234-6-9-Video-8744edeac3b4d1ce16b680916b5267ce
│       ├── dynamic29513-3-5-Video-8744edeac3b4d1ce16b680916b5267ce
│       ├── dynamic29514-2-9-Video-8744edeac3b4d1ce16b680916b5267ce
│       ├── dynamic29515-10-12-Video-9b4f6a1a067fe51e15306b9628efea20
│       ├── dynamic29623-4-9-Video-9b4f6a1a067fe51e15306b9628efea20
│       ├── dynamic29647-19-8-Video-9b4f6a1a067fe51e15306b9628efea20
│       ├── dynamic29712-5-9-Video-9b4f6a1a067fe51e15306b9628efea20
│       └── dynamic29755-2-8-Video-9b4f6a1a067fe51e15306b9628efea20
├── demo.ipynb
├── pyproject.toml
├── requirements.txt
├── runs
│   ├── viv1t_001
│   │   ├── args.yaml
│   │   ├── ckpt
│   │   │   └── model_state.pt
│   │   ├── evaluation.yaml
│   │   ├── model.txt
│   │   └── output.log
│   ├── viv1t_002
│   ├── viv1t_003
│   ├── viv1t_004
│   └── viv1t_005
├── src
│   └── viv1t
│       ├── __init__.py
│       ├── criterions.py
│       ├── data
│       │   ├── __init__.py
│       │   ├── constants.py
│       │   ├── cycle_ds.py
│       │   ├── data.py
│       │   ├── statistics.py
│       │   └── utils.py
│       ├── metrics.py
│       ├── model
│       │   ├── __init__.py
│       │   ├── core
│       │   │   ├── __init__.py
│       │   │   ├── core.py
│       │   │   ├── factorized_baseline.py
│       │   │   └── vivit.py
│       │   ├── critic.py
│       │   ├── helper.py
│       │   ├── model.py
│       │   ├── modulators
│       │   │   ├── __init__.py
│       │   │   ├── gru.py
│       │   │   ├── mlp.py
│       │   │   ├── mlp_v2.py
│       │   │   ├── mlp_v3.py
│       │   │   └── modulator.py
│       │   ├── readout
│       │   │   ├── __init__.py
│       │   │   ├── factorized.py
│       │   │   ├── gaussian2d.py
│       │   │   ├── random.py
│       │   │   └── readout.py
│       │   └── shifter.py
│       ├── scheduler.py
│       └── utils
│           ├── __init__.py
│           ├── bufferdict.py
│           ├── estimate_batch_size.py
│           ├── logger.py
│           ├── utils.py
│           └── yaml.py
└── train.py

Installation

Create conda environment viv1t in Python 3.11, install PyTorch 2.1 and the viv1t package.

conda create -n viv1t python=3.11
conda activate viv1t
pip install torch==2.1 torchvision torchaudio
# conda install pytorch=2.1 torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -e .

Demo

demo.ipynb contains a demo notebook to initialize, restore and inference the model, as well as generating the parquet files that were used in the challenge submission.

Train model

The following command trains the ViV1T core and Gaussian2d readout model on all 10 mice and saves results to --output_dir=runs/vivit/test:

python train.py --data=data/sensorium --output_dir=runs/vivit/test --transform_mode=2 --crop_frame=140 --ds_mode=3 --core=vivit --core_parallel_attention --grad_checkpointing=0 --output_mode=1 --readout=gaussian2d --batch_size=6 --clear_output_dir

Training progress will be printed in the console and also recorded in <output_dir>/output.log and model checkpoint is saved periodically in <output_dir>/ckpt/model_stat.pt.
A single NVIDIA A100 40GB was used to train the model.

Check --help for all available arguments