/mrjob_course

mrjob course for Big Dive

Primary LanguagePython

Setup for Big Dive

  1. Have a Github account (email your username to steve+bigdive@steveasleep.com
  2. Clone this repository
  3. cd to your local copy
  4. Optional: virtualenv --no-site-packages --distribute env && env/bin/activate
  5. pip install -r requirements.txt
  6. Visit Yelp Dataset Challenge and get a copy of the Yelp data set
  7. Put it in mrjob_course/data/yelp
  8. Run make

Organization

  • data/: All data to be used in exercises
  • exercises/: Test cases and stub files for exercises
  • scripts/: Some code I used to generate the data, ignore this
  • solutions/: No peeking! Completed exercise code that can be copied over the stub files to work correctly.
  • syllabus.txt: Overview of course order

Running exercises

python exercises/<exercise>/test.py