Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
Primary LanguageJava
No one’s star this repository yet.