EleutherAI/stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
PythonMIT
Stargazers
- aieveryday
- albertqjiangUniversity of Cambridge
- AnthonyDawsonThailand
- apheon-terra
- bdqnghiHuawei Research
- cahya-wirawanVienna, Austria
- DevSinghSachanMontreal
- erayhamza
- fly51flyPRIS
- franciszzjKing's College London
- gamcohGIC
- jeromeku
- jiteshpubrejaPatiala, Punjab, India
- konformal
- Ldpe2GSun Yat-sen University
- Life-0-1
- mallamanisUK
- mingwei-liu
- moisestohias
- nitishkthakurChennai, India
- prompteus
- SandalotsVolcanak
- slowwavesleep
- SophieCai
- stjordanisGreece
- sxnjeyNull
- TechnologyClassroom
- ukaserge
- utensil
- vampire-droid
- voidfulTaiwan
- wonderseen
- xzyaoiETH Zurich / @eth-easl
- yulunduCarnegie Mellon University
- ZanejinsPeking University
- zhiyueGuangzhou