- Fall 2016: Graduate Business, Leavey School of Business
- Course MSIS 2627: Big Data Modeling & Analytics
- Big-Data/MapReduce Course @ Santa Clara University
- Class duration: September 19 - December 8, 2016
- Class hours:
- Monday 5:45pm - 7:00pm PST
- Wednesday 5:45pm - 7:00pm PST
- Class room: Lucas Hall 208
- Office: 321 T, Lucas Hall
- Required books and papers (all resources are online):
The main focus of this class is to cover the following concepts:
- Concepts of Big Data
- Distributed File Systems
- Distributed Computing
- Distributed and Parallel Algorithms
- MapReduce Paradigm
- Scale-out Architectures (using Hadoop, Spark, PySpark)
- Apache Spark: http://spark.apache.org/
- Use Spark, Py-Spark, Hadoop, and Java to teach MapReduce and distributed computing
My latest book:
Data Algorithms: Recipes for Scaling up with Hadoop and Spark