Completed assignments and coding challenges from the Lambda School Data Science program.
The majority of the work from the first half of the curriculum is done within IPython notebooks run in Google's Colaboratory environment. No dependencies need to be installed on a local machine to save and work with these notebooks in Google Drive.
For assignments and coding challenges requiring installations on the local machine, instructions are provided with the specific week, or assignment. Generally, most programming is done in Python 3. I used Anaconda to create virtual environments for each new assignment in order to keep my root installation clean.
- The Feature Scaling Terminology Zoo
- Revisiting the Mathematical Foundations of Data Science
- Black Magic and Hyperparameter Tuning
- Generating Matplotlib Subplots Programmatically
- Lessons and Mistakes from my First Reinforcement Learning StarCraft Agent
- Implementing a DeepMind Baseline StarCraft Reinforcement Learning Agent
Lambda School's instruction is divided into 5-day sprints. Each sprint has an overall topic, and will contain one or more "modules" - more specific subtopics. For the first four days, there are short code challenges meant to introduce the material, as well as more in-depth assignments. Occasionally, some days will have no code challenge in favor of some other assignment or reading.
Each sprint is capped off with a comprehensive challenge that covers the breadth of the material, but not in as much depth as the weekly assignments. The material of the Sprint Challenge corresponds directly to the "learning objectives" of each module. The code challenges and assignments for a particular week can be found in the directory for that week.
GitHub's rendering of notebooks does not include animations rendered in the notebook outputs as JavaScript widgets. To view these, and certain other interactive elements, use nbviewer
.
- Functions and Optima
- Linear Algebra
- Data Preparation: An Overview
- Data Visualization
- High Dimensionality
- Dimensionality Reduction
- Presenting for the Public: LaTeX and d3
- Building an ML Portfolio
- Quantitative Data Analysis
- Graphical Data Analysis
- Statistical Techniques
- Linear Regression
- Logistic Regression
- Model Tuning
- Supervised Learning
- Clustering
- Association Rule Learning
- Collaborative Filtering
- Neural Networks
- Computer Vision
- Deep CNNs
- Natural Language Processing - Introduction
- Comparing Documents or Words
- Sentiment Analysis
- Accessing and Building Corpuses
- Spark
- SQL
- Flask
- Microsoft Azure ML Studio
- Docker
- Reinforcement Learning with OpenAI Gym
- Object Detection
- Data Structures
- Algorithms Overview
- Introduction to Graphs and Bokeh
- Connected Components
- Search
- Introduction to C
- Operating Systems