commoncrawl/cc-mrjob
Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
PythonMIT
Stargazers
- anonymuseNYC
- aspotton@AdamSpotton
- avinashkoyyanahyderabad
- berkantaydinTurkey
- BHX2Morgantown, WV
- bmccarthyComputer Solutions, Inc.
- charlesxiongBeijing,China
- dclambert
- dizcology
- ikuyamadaStudio Ousia & RIKEN
- janhoyCominvent AS
- jasonchaw"Mathematical Thinking LLC"
- johnsonleee
- jsanch@CartoDB
- JunHuang01
- kalefranzAnaconda, Inc.
- kindyBeijing, China
- ldave
- leifulstrup
- luav
- marimurakiPalo Alto, CA
- minhhahlAdelaide, SA, Australia
- mrtWild West on the Left Coast
- msjgriffiths
- MVilstrupLego
- pdenya
- pmatev@isomorphiclabs
- pyghassen
- renqHIT
- Rlover
- sillyer
- trypyWashington D.C. Metro Area
- wangst321
- Will-SoAmazon
- yanwang10
- yask123@Spotify