This is a dockerized version of TimeLapse pipeline. Work is still in progress.
To build the image from scratch, install docker and clone the repository. Then run:
cd TimeLapse-docker
docker build --platform=linux/amd64 -t timelapse . --build-arg GIT_NAME={NET_ID} --build-arg GIT_PWD={NET_ID_PASSWORD}
Where {NET_ID}
and {NET_ID_PASSWORD}
are your credetials to log in into TimeLapse Git repository.
To run the image, first endure that all .fastq files are in their separate directories named after the respecitve sample and they are all placed in ./fastq
directory. In addition, required are: 1) genome sequence in .fasta file format, 2) genome annotation as .gtf file, 3) hisat-3n genome index files.
Image can be then run as docker run -it <parameters> -v ${PWD}:/data timelapse
Or from prebuit image from Docker Hub docker run -it <parameters> -v ${PWD}:/data machyna/timelapse
SAMPLES - comma-separated list of all sample and control names
CONTROLS - comma-separated list of -s4U control name(s)
PREFIX - avoid typing repetitive part of sample names e.g. \"Sample_*\" (default: none)
GENOME - species [Hs, Mm, Dm] (default: Mm)
FORMAT - reads format [PE, SE, NU] (default: PE)
READS - reads strandness [FR, RF, F] (default: RF)
MUT_TRACKS - Mutation type(s) to analyze [TC, GA, TC,GA] (default: TC)
STL - experiment is STL (default: off)
MUT_POS - call mutations by absolute genomic location (default: TRUE)
BIGWIG - Make additional .bigWig track files (default: TRUE)
CHR_TAG - add chr prefix to chromosome names during alignment (default: FALSE)
SPIKEIN_NAME - common string identifying spikein names in .gtf file
FASTA - path to genome fasta file
GTF - path to gtf genome annotation file
INDEX - path to aligner genome index (HISAT-3N)
CPUS - number of cpus used by pipeline or Slurm jobs [int] (default: 10)
MEM - memory used by Slurm jobs [intG](default: 20G)
e.g.
docker run -it -e SAMPLES=LC210501_1in,LC210501_2A \
-e CONTROLS=LC210501_1in \
-e SPECIES=Hs\
-e CPUS=10 \
-e MEM=20G \
-e FORMAT=PE \
-e MUT_TRACKS=TC \
-e FASTA=genome/GRCh38_dm6_dm.fa \
-e GTF=genome/GRCh38_dm6_dm.gtf \
-e INDEX=genome/GRCh38_dm6_dm \
-v ${PWD}:/data \
timelapse