(C) 2016 Steve Phelps
This is a collection of IPython notebooks that I use to teach topics relating to Data Science and Big Data.
- Programming in Python
- Numerical Computing
- Relational data
- Analysing structured data using pandas
- Map-Reduce programming and Apache Spark
- Column-oriented databases with HBase and HappyBase
A virtual-machine containing all of the prerequisite software can be downloaded here. This can be imported as a virtual appliance using VirtualBox.
This work is licensed under the Creative Commons Attribution 4.0 International license agreement.