gaohuan2015/openwebtext
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
PythonGPL-3.0
No issues in this repository yet.
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
PythonGPL-3.0
No issues in this repository yet.