This repository contains the source code for the NER baseline presented in the following research publication (PDF)
Abbas Ghaddar and Philippe Langlais
"WiNER: A Wikipedia annotated corpus for Named Entity Recognition",
In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017).
- python >= 2.7
- scikit-learn
- KenLM
- gensim
$ python winer.py