/lsml20

Practical sessions for the 2020 Large Scale Machine Learning course at MINES ParisTech

Primary LanguageJupyter Notebook

lsml20

Practical sessions for the 2020 Large Scale Machine Learning course at MINES ParisTech.

2020-03-23 Introduction to Machine Learning with Python

This repository contains Jupyter notebooks for the first afternoon of practical sessions of the LSML 2020 course.

To run the notebooks, you need to start Jupyter by typing

jupyter notebook

in a terminal. This will open a file navigator in your web browser. Navigate to this repository. You can then open any notebook by clicking on it.

If you are already familiar with Python and numerical Python (numpy) as well as matplotlib, you can start with Notebook 3. We recommend, however, doing the small problems in Notebooks 1 and 2 to familiarize yourselves with Jupyter and refresh your knowledge of these tools.

The focus of our first afternoon of practicals is going to be Notebooks 3 and 4, which introduce data manipulation and machine learning with scikit-learn. If you still have time, you can get started on Notebook 5.

2020-03-24 Feature engineering with Python

We have so far assumed that our data is represented by a well-behaved matrix X with n rows/samples and p columns, each corresponding to a feature. An important part of data science, however, is to transform raw data in such features.

Start with Notebook 5 (from the previous day) and move on to Notebook 6, where you'll explore data pre-processing in more details.

2020-03-24 Convolutional Neural Networks

If you have never trained a ConvNet before (in particular, if you did not take the "Apprentissage Artificiel" course by Fabien Moutarde), start with CNN_Introduction.

Then you can move on to transfer learning with deep networks at Transfer_Learning

2020-03-25 Stochastic Gradient Descent

A large part of machine learning rests on convex optimization and gradient descent techniques. How does one perform gradient descent with large data? This is what you'll explore in Notebook 7.