/Ericscript

ChimERIC TranSCRIPT detection

Primary LanguageRGNU General Public License v3.0GPL-3.0

EricScript Readme v2.1 (Feb 2016)

Fork created by mmterpstra@github.com due to inactivity on the original forum. The version the files from is v0.5.5. This goals: run a minimal install/test on travis if possible. Feel free to submit a pull request if you think you can fix it faster and better.

Please feel free to email the author if you have any questions or issues. matteo.benelli AT gmail.com

INFORMATION

EricScript is a software package developed in R, perl and bash scripts. EricScript uses the BWA aligner to perform the mapping on the transcriptome reference and samtools to handle with SAM/BAM files. Recalibration of the exon junction reference is performed by using BLAT.

REQUIREMENTS

Download and install the following software

Be sure that all of these programs are included in your PATH.

RUNNING ERIC

Once you have downloaded EricScript, extract the package

tar -xjf ericscript.tar.bz2

Make a copy of the program folder to your favorite location. Before running for the first time EricScript, you need to make ericscript.pl executable:

chmod +x /PATH/TO/ERIC/ericscript.pl

To get information about running EricScript, digit:

/PATH/TO/ERIC/ericscript.pl --help

In order to perform chimeric transcript detection, you need to download and build the Ensembl Database of a genome. To list the available genomes, digit:

/PATH/TO/ERIC/ericscript.pl --printdb -db /PATH/TO/YOUR/DBFOLDER

After a reference id is selected, you need to download and build the corresponding Ensembl Database. In the example below, it's shown how to prepare the database for saccharomyces cerevisiae.

/PATH/TO/ERIC/ericscript.pl --downdb --refid saccharomyces_cerevisiae -db /PATH/TO/YOUR/DBFOLDER

You can also select a specific ensembl release (>= 70 <=82) to download

/PATH/TO/ERIC/ericscript.pl --downdb --refid saccharomyces_cerevisiae -db /PATH/TO/YOUR/DBFOLDER --ensversion 74

To run EricScript with default parameters (if parameter "refid" is not specified the analysis takes the homo sapiens species as default):

/PATH/TO/ERIC/ericscript.pl -db /PATH/TO/YOUR/DBFOLDER --refid saccharomyces_cerevisiae -name SAMPLENAME -o /PATH/TO/OUTPUT/ YOUR_FASTQ_1 YOUR_FASTQ_2 

You can check if your database is up-to-date by the following:

/PATH/TO/ERIC/ericscript.pl --checkdb -db /PATH/TO/YOUR/DBFOLDER

OUTPUT FILES

The /PATH/TO/OUTPUT/ folder contains the results of the analysis. Predicted gene fusion products are reported in 2 files:

  • samplename.results.total.csv: contains all the predicted gene fusions.
  • samplename.results.filtered.csv: contains the predicted gene fusions with EricScore > 0.50.