Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.
Primary LanguageJavaMIT LicenseMIT