genSetAnnotation

genSetAnnotation main repository

The genSetAnnotation is a pipeline (written in Java) dedicated to the annotation of human gene sets using the Gene Ontology (GO). The pipeline combines ontology methods, data mining approaches and combinatory algorithms to provide a representative annotation for a given gene set.

The following components are included in the repository:

Maven source
[resources] Two lists of gene modules: BTM and V2_Trial_8
[resources] OWL of Gene Ontology: version 07/05/2018
[resources] Gene Ontology Annotation is provided in gaf 2.0 format: version 21/05/2018
[resources] R scripts
[results] All analysis results
Runnable jars necessary to execute the pipeline

Running analysis

First of all, you need to run the Rscript DavidAnalysis.R to get the enrichment annotations for the BTM and V2_Trial_8 modules.

Rscript resources/Rscript/DavidAnalysis.R Note: Verify if the first line of the script is uncommented

To run the analysis using the BTM and V2_Trial_8 modules, you have to launch in your terminal the following command:

java -jar -Xmx20g -Xms10g geneSetAnnotationPipeline.jar

The -Xmx and -Xms are adapted to a computer with at least 25 Go of RAM.

Then the R script can be run from the terminal, but before doing this, you need to uncomment the first line (corresponding to setwd(dirname(rstudioapi::getSourceEditorContext()$path))). The recommended order to run the different R scripts is:

ClusterQuality.R
zindex.R
Visualization.R
supplementaryVis.R

Rscript resources/Rscript/.R

Analysis of a gene set

To analyse exclusively one gene set (to recover its representative annotation), you need to execute the following line command:

java -jar geneSetAnnotationAnalysis.jar <list_genes>

The list of genes must be provided as symbol ids and each id has to be separated by coma (without space). An example is given:

java -jar geneSetAnnotationAnalysis.jar LY6E,IFIT1,OAS1,IFIT1,IFIT3,OAS3,IFIT3,OAS1,OASL,LOC129607,ISG15,HERC5,OAS1,MX1,BATF2,LAMP3,IFI44L,XAF1,OASL,IFI44,OAS2,TRIM6,HES4,OTOF,FLJ20035,IFITM3,IFIT3,CXCL10,EPSTI1,SERPING1,LOC26010,OAS2,RSAD2,RTP4

Ayllonbe/geneSetAnnotation

genSetAnnotation