/wikipedia2corpus

Wikipedia text corpus for self-supervised NLP model training

Primary LanguagePythonMIT LicenseMIT

Watchers