Resources to reproduce the comparison of ontology mappings generated by text2term with:
- UK Biobank to EFO phenotype mappings
- Biomappings collection of community curated mappings
- Ontology Lookup Service (OLS) mappings of biomedical terms to EFO
The mapping comparison can be executed by running the Python script:
python compare_ontology_mappings.py
The inputs for the comparison are:
data/UK_Biobank_master_file.tsv
: UK Biobank phenotype mappings, downloaded on 2024-03-14 (URL).data/biomappings.tsv
: Biomappings table, downloaded on 2024-04-08 (URL).data/efo.ols.sssom.tsv
: mappings of various ontologies to EFO, downloaded on 2024-04-10 (URL).
data/efo_edges.tsv
: table containing all pairs of asserted subsumptions between terms in EFO.data/efo_entailed_edges.tsv
: table containing all pairs of entailed subsumptions between terms in EFO.
The outputs of the comparison are, for each benchmark mapping set:
output/{dataset}_t2t_mappings.csv
: table containing the generated text2term mappings.output/{dataset}_mappings.tsv
: table containing mappings extracted from the original dataset.output/{dataset}_results.tsv
: table containing the results of the mappings' comparison.