Data science bootcamp at Sciences Po
- lead : Sylvain ;
- format : Xaringan ;
- presentation : link
Colab Notebook Python basics
Source : https://github.com/moreymat/scpo-data-science-bootcamp/blob/main/notebooks/1_python.ipynb
Goals :
- use a notebook
- declare a variable
- execute a statement
- print a message
- wrap-up
Goals :
- import a package (NumPy)
- manipulate single numbers
- create a one-dimensional data structure
- select elements from a one-dimensional data structure
- perform operations on elements of a one-dimensional data structure
- create a two-dimensional data structure
- select elements from a two-dimensional data structure
- perform operations on elements of a two-dimensional data structure
- wrap-up
- lead : Sylvain ;
- format : Xaringan ;
- presentation : link
Program : ()
Colab Notebook Tabular data analysis 1 : Loading Open Food Facts data with pandas
Source : https://github.com/moreymat/scpo-data-science-bootcamp/blob/main/notebooks/2_pandas.ipynb
- load the data from a tabular format
- store data in a variable
- print data from the variable
- identify the dimensions from a dataset
- identify variable names
- select elements from a dataset
- subset a dataset
- write an object
- wrap-up
- load the OFF dataset
- filter the data by a specific variable
- lead : Sylvain ;
- format : Xaringan ;
- presentation : link
Program : (brands are fighting on the method of calculation of the nutrition score, let's make it ourselves)
Colab Notebook Data visualization
Source : https://github.com/moreymat/scpo-data-science-bootcamp/blob/main/notebooks/3_dataviz.ipynb
You should be able to access the OFF data files from the notebook but just in case, here are the direct links to the Google Drive (access restricted to Sciences Po) :
The first notebook is adapted from https://colab.research.google.com/github/data-psl/lectures2020/blob/master/notebooks/01_python_basics.ipynb