These instructions describe how to run a lightly-modified version of the Pacemakers Kaggle kernel on Imperial College's HPC system.
Platforms such as Kaggle, Colab and Azure Notebooks are great for sharing notebooks but there are advantages to using the RCS Compute Service for your research:
- Your data remains inside the College, via the RDS
- You can run long, non-interactive and/or parallel jobs (see below)
- You have access to multi-GPU nodes and several models of GPU (details)
- Clone or download this repository to the HPC system
- Download the data to
Train
andTest
folders in the same directory - Create a conda environment with the required dependencies:
conda env create --file environment.yml
To run the notebook in Jupyter (P1000):
- Visit the RCS Jupyter Service
- Create a new server (GPU recommended)
- Open
pacemakers.ipynb
and run the notebook
To run the notebook as a job (P100):
qsub pacemakers.pbs.sh
- On job completion visit the RCS Jupyter Service
- Open
pacemakers.ipynb
and review the outputs
This repository has two branches. The master
branch (the default) targets a single GPU. The multi-gpu
branch uses DataParallel
to target two GPUs. You can see the relevant modifications by comparing the branches.
Many thanks to James Howard for sharing his work and reviewing these instructions.