AUC calculation

This script calculates the area-under-the-curve of user-defined gene sets for a given dataset
Author: Mudra Hegde
Email: mhegde@broadinstitute.org

Inputs

  1. Input File: .txt file with list of sgRNAs in the first column and log-fold changes for every sample in the following columns. Column names of log-fold changes will be sample name in the output file.
  2. Chip File: .txt file to map sgRNAs to gene symbols;First column should be the list of sgRNAs and second column should be the corresponding gene symbols
  3. Gene sets folder: Folder with .txt files of each gene set for AUC calculation. Column name for the gene set in the output file will be the file name of the gene set. Gene set file should be a .txt file with one column of gene symbols and no header.
  4. Output File

To run this script, type the following on the terminal:
python calc_auc_v1.1.py --input-file <Path to inputfile> --chip-file <Path to chip file> --gene-set-folder <Path to folder> --outputfile <Path to output file>

To view the available options in the script, type the following on the terminal:
python calc_auc_v1.1.py -h

Sets of essential and non-essential genes from Hart et al.,2015 and Hart et al.,2014, respectively, are included along with this script.