- Folder containing config file and launcher scripts for IMP
# install miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
chmod u+x Miniconda3-latest-Linux-x86_64.sh
./Miniconda3-latest-Linux-x86_64.sh # follow the instructions
Getting the repository including sub-modules
git clone git@github.com:susheelbhanu/imp_docs.git
Create the main snakemake
environment
- Note: This will create an environment called
snakemake
. - If you already have a
snakemake conda env
, this is not required.
# create venv
conda env create -f envs/requirements.yaml"
Description
- Below is a list of the files
- The changes that need to be made are indicated for each file
Edit or run
each file in the order listed below, i.e. after #4-editing samples, run #5-launching jobs
-
IMP_config.yaml
: config file with necessary paths, databases, and parameters for snakemake -
launchIMP.sh
: SLURM launcher script forrunIMP.sh
-
runIMP.sh
: bash script to launch IMP using snakemake -
edit_samples.sh
: bash script to make sample folder, add config.yaml, launchIMP and runIMP scripts and simultaneously edit them -
launch_jobs.sh
: bash script to launch multiple samples
Changes to be made
- Below files need to be edited or checked for file paths and database locations
-- IMP_config.yaml
forward/reverse
paths inline 14
formetaG/metaT
outputdir
: path inline 23
summarydir
: path nline 24
filtering
: select fromhg38/PhiX,PhiX174
db_path
: ensure access to dbs inline 30
exist
-- launchIMP.sh
- Edit the
time
,N#
,n#
, andqos
as needed - For jobs estimated to be longer than
2 days
use the-qos long
option
-- runIMP.sh
- Check if access to the
/work/projects/ecosystem_biology/local_tools
is available
-- edit_samples.sh
- Note: Run this script from the directory where the individual sample folders will exist
-- launch_jobs.sh
- Prior to running do:
chmod +x launch_jobs.sh