Unable to obtain list of significant genes for each GO term in GO enrichment test
Opened this issue · 9 comments
Hi there,
I would like to obtain the list of genes that are in each enriched GO term. It doesn't seem as though the results or even the IDs of the significant genes are provided in the results table, but only the number of genes in the GO term reference list. Is there a way I can obtain these using the API?
Thanks for your time,
AT
Currently the API does not output the genes associated with each term. However, you can obtain the associated genes for each term by running the analysis from the website and exporting the results to JSON or XML.
- You would start from http://pantherdb.org/
- Click on the browse button to select a file with the gene ids from the comparison list
- Select the organism
- Select Statistical enrichment test and the annotation data set (Example Full GO BP, PANTHER MF, Reactome...)
- Click submit
- The system will prompt you to select a reference gene list. Here you can upload a custom gene list on the right hand side or select a gene list with all genes for the organism
- The system will now prompt with a selection summary. Here you can add more comparison lists and also update the analysis type and correction.
- Clicking on Launch Analysis will result in running the analysis.
- From here you can export the results as a table or to get associated input genes use the XML or JSON option.
Okay, thanks for the instructions and quick response. I think that would be a nice new feature to have. I am using the API within an R Shiny application and making the parameters for the request interactive for the user. So I would have to run it manually for every possible combination which is not feasible.
@mugitty I have just noticed there is a POST request "geneinfo" which returns associated GO terms with a given gene. Could I use this with the same gene list I input for the GO enrichment and take the intersection of the GO terms to get the associated genes? Only issue I see with this is that there is no gene ID in the results column but a "persistent_id". Can I assume this new ID has the same order as the input gene list?
@ayeTown the "geneinfo" POST request does not return the results in the same order as the request. Furthermore, not all genes have an associated "presistent_id".
@ayeTown After getting the results from the analysis, the user should be able to narrow down the lists with results that require further investigation. This number is usually very small.
Okay, thanks for your help!
@mugitty is there a way to obtain the associated genes (in reference) with each go term? I would like a static data frame of the genes associated with each go term
@ayeTown, The output is large when the gene ids are added for each of the enriched terms. We will let you know if there is another way to obtain the mappings
Currently the API does not output the genes associated with each term
By all means, this is the biggest limitation of the API (read deal-breaking) since its inception. I'd like let the PANTHER team know about it, where do I send my feedback?