/WiNER

Primary LanguagePythonMIT LicenseMIT

WiNER

WiNER: A Wikipedia Annotated Corpus for Named Entity Recognition

This repository contains the source code for the NER baseline presented in the following research publication (PDF)

Abbas Ghaddar and Philippe Langlais 
"WiNER: A Wikipedia annotated corpus for Named Entity Recognition",
In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017).

Requirements

  • python >= 2.7
  • scikit-learn
  • KenLM
  • gensim

Demo

$ python winer.py