/Big_Data

My courses and activities in Big Data

Primary LanguageJupyter NotebookMIT LicenseMIT

Learning Big Data

Resource attributes

Since resources across the internet vary in terms of their pre-requisites and general accessibility, it is useful to give attributes to them so that it is easy to understand where a resource fits into the wider machine learning scope. Below is a few suggested attributes (please extend):

  • 📘 = Doing
  • ✔️ = Completed
  • 🌈 = creative
  • :bowtie: = beginner
  • 😅 = intermediate, some pre-requisites
  • :godmode: = advanced, many pre-requisites

Tools Used

Hadoop, Hive, HBase, ZooKeeper, Oozie, Sorl, Kafka, Pig, MapReduce, YARN, Spark, Scala and Python.

Accelerated Learning Techniques

  • Watch videos at 2x or 3x speed using a browser extension
  • Handwrite notes as you watch for memory retention
  • Immerse yourself in the community

Real-World Tools

Big Data Fundamentals

Hadoop

Scala

Data Storytelling

Spark