/Big_Data_Processing_Projects

This repository contains the course work for the Big Data as a part of Master's in Data Science program at UMBC.

Primary LanguageJupyter Notebook

This repository consists of several projects that dealt with Big Data using various Big Data Processing tools.

The technologies and tools include -

  • Apache Spark
    • Spark SQL
    • Spark ML Library
    • Spark Streaming
  • Python
    • Pandas
    • Matplotlib
  • Hadoop
    • HDFS