The goal of this project is to compare the content/knowledge of different Wikipedia projects. In particular, we are interested in multilingual Wikipedias and Wikidata.
For example, looking at the University of Amsterdam:
UvA (Dutch) | UvA (English) | UvA (Wikidata) |
---|---|---|
You see different content. The goal of this project to create quantative measures of the different.
This is useful in the context of projects we work on in indelab.org which focus on adding knowledge to knowledge bases like Wikidata.
See for example:
- Prompting as Probing: Using Language Models for Knowledge Base Construction by Dimitrios Alivanistos, Selene Báez Santamaría, Michael Cochez, Jan-Christoph Kalo, Emile van Krieken, Thiviyan Thanapalasingam Github
- Inductive Entity Representations from Text via Link Prediction Daniel Daza, Michael Cochez, and Paul Groth, in The Web Conference 2021. Github