Description
This hands-on course teaches the tools & methods used by data scientists, from researching solutions to scaling up prototypes to Spark clusters. It exposes the students to the entire data science pipeline, from data acquisition to extracting valuable insights applied to real-world problems.
Questions
Public questions and discussions about the course are gathered in the course repository.
Virtual Machine
Lab Sessions
Week 1 - 21.02.2018 - Module 1 - Python for data scientists 1/4
- Slides: week 1
- Python Quick Reference: notebook
- Exercises: download - view on github
Week 2 - 28.02.2018 - Module 1 - Python for data scientists 2/4
- Slides: week 2
- Solutions to last week's exercises: download - view on github
- Exercises - Set #1: download - view on github
- Exercises - Set #2: download - view on github