/OWL2Vec-Star

Embedding OWL ontologies

Primary LanguagePython

Dependences

Our codes in this package have been tested with

  1. Python 3.7
  2. RDFLib 4.2.2
  3. gensim 3.8.0 (Note that gensim 4.x brings important changes, see documention here)
  4. scikit-learn 0.21.2
  5. nltk 3.5
  6. OWLready2 0.25

Acknowledgement: Codes under rdf2vec/, which mainly implement walking strategies over RDF graph, come from pyRDF2Vec (version 0.0.3, accessed in 03/2020), with some updates made.

Standalone Application

The standalone application (v0.1.0) can be run by the two main programs.

  1. OWL2Vec_Standalone.py

    This program will embed one ontology. It can be configured by the configuration file default.cfg. See the examples and comments in default.cfg for the usage.

    Running example: python --config_file default.cfg

    Note: Different from the experimental codes, the standalone program has implemented all OWL ontology relevant procedures in python with Owlready, but it also allows the user to use pre-calculated annotations/axioms/entities/projection for generating the corpus.

  2. OWL2Vec_Standalone_Multi.py

    This program will embed multiple ontologies into one language model, where the documents from multiple ontologies will be merged. One use case example is embedding all the conference relevant ontologies of the OAEI conference track by once.

    Running example: python --config_file default_multi.cfg

    Note: Different from OWL2Vec_Standalone.py, this program for multiple ontologies does NOT allow the pre-calculated or external annotations/axioms/entities/projection.

Publications

Main Reference

  • Jiaoyan Chen, Pan Hu, Ernesto Jimenez-Ruiz, Ole Magnus Holter, Denvar Antonyrajah, and Ian Horrocks. OWL2Vec*: Embedding of OWL ontologies. Machine Learning, Springer, 2021. (accepted). Codes for the computed results.

Applications with OWL2Vec*

  • Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Denvar Antonyrajah, Ali Hadian, Jaehun Lee. Augmenting Ontology Alignment by Semantic Embedding and Distant Supervision. European Semantic Web Conference, ESWC 2021. (PDF)
  • Ashley Ritchie, Jiaoyan Chen, Leyla Jael Castro, Dietrich Rebholz-Schuhmann, Ernesto JimĂ©nez-Ruiz. Ontology Clustering with OWL2Vec*. DeepOntonNLP ESWC Workshop 2021. (PDF)

Preliminary Publications

  • Ole Magnus Holter, Erik Bryhn Myklebust, Jiaoyan Chen and Ernesto Jimenez-Ruiz. Embedding OWL ontologies with OWL2Vec. International Semantic Web Conference. Poster & Demos. 2019. (PDF)
  • Ole Magnus Holter. Semantic Embeddings for OWL 2 Ontologies. MSc thesis, University of Oslo. 2019. (PDF)

Case Studies

Data and codes for class membership prediction on the Healthy Lifestyles (HeLis) ontology, and class subsumption prediction on the food ontology FoodOn and the Gene Ontology (GO), are under the folder case_studies/.