The repo contains all of the work for CS 598's cloud compouting capstone.
The Capstone was to create 2 different versions of an analytics pipeline to analyze and answer questions about flight data. The first version (Task 1) was with a batch jobs, and the second version (task 2) was to build the pipeline using streaming (kafka).