WGLab/phenolyzer

Recommendations for maintaining external databases?

Closed this issue · 9 comments

Hello,

I would like to use this tool but noticed the external databases have not been updated in a while. Are there any recommendations for maintaining these?

If we would like to use this tool but would like to use it with regularly updated databases, what would you recommend? Should we figure out how to do the updates ourselves (hopefully with your help)? or is it possible for your team to update regularly? Thank you so much for making this fantastic tool available.

Hello, just checking in on this. Thanks.

I started a related issue in EHR-phenolyzer
WGLab/EHR-Phenolyzer#14

Thanks!

I have tentatively pushed a updatedb_20230107 branch with some testing. The number of records in the old version and new version is given below. Additionally, the hpo.obo and doid.obo files are updated to 2023 version. The formats and certain keywords for some of the original databases have been changed; to be backwards compatible with old versions of Phenolyzer, appropriate care has been taken to modify some of the DB files below.

61370 207792 DB_HUMAN_GENE_ID
36558 75784 DB_GWAS_GENE_DISEASE
6179 7135 DB_CLINVAR_GENE_DISEASE
7273 8059 DB_OMIM_GENE_DISEASE
5714 8248 DB_ORPHANET_GENE_DISEASE
55219 92845 DB_COMPILED_GENE_DISEASE_SCORE
113724 252457 DB_HPO_ANNOTATION

Down the road, I will try to clean up the file preparation scripts so that users can use them to update databases, as it currently involves a lot of manual steps. Additionally, some of the databases probably should be switched to other choices, for example, biosystems pathway database has not been changed since 2017 in NIH website.

Please test the new version and see if there is any issues. Thank you very much for your interests in the tool. If there is no major issues, I will merge this into master branch soon, and then update Phen2Gene as well.

Thanks so much. We'll take a look.

Very helpful, we will test it and let you know.

Thank you! We tested the new version and do not see any issues.

Thank you! This is greatly appreciated.