/HadoopCode

A place for me to learn how to use various tools in the Hadoop ecosystem

Primary LanguageJavaMIT LicenseMIT

HadoopCode

A small place for small Hadoop code

All of the net.kwaz.chicago code is written using Weather Underground's Chicago weather data from opensciencedatacloud.org. This dataset was chosen only because it was small enough to reasonably process on my multiple VM single machine grid while not being completely insignificant or fake.

Here is a tarball of the input files for the raw parser MR job within this codebase. I combined all of the files for a particular zip code into a single larger file for the sake of not making HDFS freak out.