/resh-edu

Scripts used to parse and process the resh.edu.ru dataset

Primary LanguagePythonCreative Commons Zero v1.0 UniversalCC0-1.0

resh-edu

This repository contains scripts used to scrape and process data from resh.edu.ru. More information and the dataset can be found here.

A version of the dataset for causal language modeling will be coming in a few days, but only the raw version is available for now.