/wikihadoop

Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop

Primary LanguagePython

No issues in this repository yet.