/TDT4305

Code and report for our project in TDT4305 BIg Data Architecture at NTNU spring 2016

Primary LanguagePythonMIT LicenseMIT

TDT4305 Big Data Architecture

Setup

  1. Install and setup Python 2.7, Spark and Pyspark.
  2. Unpack the ZIP-files with the data.

Run

  1. spark-submit task1/foursquare.py input_data_path
  2. spark-submit task2/twitter.py input_data_file output_data_file.