This the CREPE GitHub repository.
- Open the App.R script in RStudio
- Click on the Run App icon located at the top of the app
- An html window will pop up with CREPE running
We provide tutorial files under the MinimalReproducibleExample/ directory.
This function runs a domain analysis to find protein sequences that posess TF DNA-binding Domains (DBDs), and parses these results to provide a TF catalogue.
- Run CREPE (see section above)
- Open the sidebar menu (Three Parallel Bars Button in the top left)
- Under 'Choose Analysis To Run' click on TF Cataloguing.
- Under 'Protein Fasta' click on the 'Browse' button to search for the MRE fasta file
CREPE/MinimalReproducibleExample/TF_Cataloguing/Human_MRE.fasta
- In the 'Species Name' text box write Human, or any other desired identifier.
- An indicator will pop-up in the bottom right of the screen indicating progress; may take several minutes to complete.
- A summary figure of the TF catalogue will be displayed after the analysis is complete.
- Click on the 'Download TF Catalogue' to download cataloguing results.
This function parses gene trees (in Newick format) to assign putative TFs the name of its nearest neighbor. The intention is to provide a name to TFs originating from non-model organisms.
- Run CREPE (see section above)
- Open the sidebar menu (Three Parallel Bars Button in the top left)
- Under 'Choose Analysis To Run' click on TF Annotation.
- Under 'Phylogenetic Trees' click on the 'Select Folder' button to search for the MRE trees
CREPE/MinimalReproducibleExample/TF_Annotation/Trees/
- Under Metadata click on the 'Browse' button to search for the MRE metadata file.
CREPE/MinimalReproducibleExample/TF_Annotation/metadata.csv
- An indicator will pop-up in the bottom right of the screen indicating progress; may take several minutes to complete.
- A table displaying the results will appear after analysis is complete.
- To download the results, choose the species to map to under the 'Pick Mapping to Download' menu; the current options are human, fly or nearest species. Afterwards, click on the 'Download TF Annotation' button.
This function is for those who would like to perform the domain analysis separately using other software, or using different DBD models.
- Run CREPE (see section above)
- Open the sidebar menu (Three Parallel Bars Button in the top left)
- Under 'Choose Analysis To Run' click on Custom.
- Under 'Protein Fasta' click on the 'Browse' button to search for the MRE fasta file
CREPE/MinimalReproducibleExample/TF_Cataloguing/Human_MRE.fasta
- Under 'Tabular Domain Outfile' click on the 'Browse' button to search for the tabular domain analysis run separately by the user. In this case we added the 'Myc_N' domain as an example. **PLEASE NOTE: This is not a sequence specific TF DBD, it is only shown here as an example. **
CREPE/MinimalReproducibleExample/Custom/TabDomainOutfile.tsv
- In the 'Custom Accension' text box write the accession for the additional DBDs used to make the Tabular Domain Outfile. In this case the accession is
PF01056
- In the 'Species Name' text box write Human, or any other desired identifier.
- An indicator will pop-up in the bottom right of the screen indicating progress; may take several minutes to complete.
- A summary figure of the TF catalogue will be displayed after the analysis is complete.
- Click on the 'Download TF Catalogue' to download cataloguing results.
There are several tools available to perform this task. However, to be compatible with CREPE the Tabular Domain Outfile must contain the following information in order:
DBD_Name | Accession | Query_Name | Domain_Start | Domain_Start |
---|
Additionally, the Accensions of the DBD models referenced in CREPE originate from the PFAM database. If the analysis is performed using DBD models from other databases they will not be identified by CREPE.