/MSBX5420Spring2019

Course material for MSBX5420 at CU-Boulder

Primary LanguageJupyter Notebook

MSBX 5420 - Spring 2019

Unstructured and Distributed Data Modeling and Analysis

Leeds School of Business, University of Colorado Boulder

Instructor: Dr. Spencer Stirling

Co-Instructor: Marilyn Waldman

Contact us

Schedule (subject to change)

Date Topic
January 14 VM Installation
Course Overview
What is Virtualization?
HOMEWORK Linux Basics and Bash
January 21 MLK NO CLASS
HOMEWORK Intro to Python
January 28 Virtualization-lite: Docker
Install HDFS and Spark cluster
HOMEWORK Manage source code with git
February 4 Spark 2
February 11 Exam 1
February 18 Spark 3
February 25 Spark 4
March 4 Hive 1
March 11 Hive 2
March 18 Exam 2
March 25 Spring Break NO CLASS
April 1 Kafka 1
April 8 Kafka 2
April 15 Kafka 3
April 22 Guest Lecture: Tim Berglund from Confluent
April 29 Elasticsearch
Final To be announced