whym/wikihadoop
Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop
Java
Stargazers
- adriaantAmstelveen, The Netherlands
- anoiaqueFrance
- antoine-tranMeta | Bosch
- Arkanosisdnalyze.me
- bjzu
- ChadFulton
- clmb@FUB-HCC
- colinpollockSan Mateo, CA
- dahaSSAB Oxelösund
- dataartisanChicago, IL
- ddanielsNew York, NY
- drdee@wealthsimple
- EphorusTechniekEphorus BV
- estebanAcryl Data, Inc.
- fengzanfeng
- griggheoLos Angeles
- harit-sunrunCA
- igrigorikShopify
- jbensleyWhitesboro, Texas
- jmarizgitxchema
- lancejpollardCalifornia
- lpm11
- muehlburgerAI-Trust ZT Gmbh
- nellaivijayDell
- noianoPrometeia
- peterstylesMelbourne
- sbeckerivDeath By Escalator
- srikanthlogicHyderabad
- swallingSan Francisco
- tepie
- tommorrisLondon
- uiltondutra@prodeal360
- valpackettArgentina ⭐⭐⭐
- velniukas10xEngineer
- whym
- yourabiSeattle, WA