A code repo containing a potential framework for carrying out experiments on demultiplexing data from pooled experimental designs without any type of genotype information of the pooled samples.
To be filled
All models are implemented using Pytorch, along with many other smaller packages. Models can be trained on CPU but it would be beneficial to use GPU. To take care of everything at once, we recommend using the conda package management library. Specifically, miniconda3, as it is lightweight and fast to install. If you have an existing miniconda3 installation please start at step 3. If you want to install both conda and the required packages, please run:
wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
- Go through the installation.
- Activate conda
- conda create -n <your_environment_name> python=3.6.
- conda activate <your_environment_name>
- Then run
bash install.sh
The install.sh file is used to install all the packages required. You could modify it to install the pytorch version of your preference.
All models require NxM numpy arrays as inputs, where N refers to the number of cells and M refers to the number of variants.
- Datasets.py: Contains a function to create the Dataset in a way that can be used by pytorch's DataLoader.
- utils.py: Containing useful functions for the analysis.
- Networks: Different architectures tested for demultiplexing scRNA-seq. More informations can be obtained by reading the following papers: