/wiki2text

Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.

Primary LanguageNimMIT LicenseMIT

Stargazers