/wikipedia-extractor

Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory. This fork is merely a wrapper around bwbaugh/wikipedia-extractor that allows it to be installed with pip.

Primary LanguagePython

No issues in this repository yet.