huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
PythonApache-2.0
Stargazers
- 3outeilleHuggingFace
- abacajsoftware eng building things
- aflah02Indraprastha Institute of Information Technology Delhi
- alexchapeaux
- anas-awadallaSeattle, Washington
- anton-lHugging Face
- baggipontextreamsrl
- casper-hansenCopenhagen, Denmark
- DanielHesslowAdaptive
- evdcush
- fakerybakeryFull-Time Open-Source Contributor
- flozi00A\\Ware
- flrngel@Ainbr
- flukeskywalkerNNAISENSE
- guipenedoHuggingFace
- hoagy-davis-digges
- jramapuram
- Kreshnik@SPRIGS
- ksindi@runwayml
- mc0ps
- menegazzi
- MuhtashamTU Munich
- osansevieroHugging Face
- pavelklymenkoSan Francisco Bay Area
- pharringtonp19
- philschmid@huggingface
- plaggy
- rohan-paulhttps://www.linkedin.com/in/rohan-paul-ai
- rwightmanVancouver, BC
- saforem2@argonne-lcf
- seshurajup@dolcera
- sodabeta7Apple
- tensorboyTikTok Inc
- thomwolf@huggingface
- TJ-SolergibertLausanne
- WissamAntounInria-ALMAnaCH