Collection of useful and heterogeneous data.
-
All data
make data_all
-
INSEE data
- population counts
- confiance des ménages
- climat des affaires
- couple-famille-menages (France / ex: 2016)
- diplome-formation (France / ex: 2016)
make data_insee
-
CAF data
make data_caf
-
LA POSTE data
make data_laposte
-
Miscellaneous data
make data_misc
-
World Bank data
make data_world_bank
-
European Central Bank
make data_ecb
- Sourcing / Put the url in
data/<data_source_name>/source_config.py
(url, raw and processed file) - Downloading / Add the function to download the data in
data/<data_source_name>/get_data.py
* - Formatting final data / Add the function to format and write the final data in
data/<data_source_name>/make_data.py
* - Automating / Add the automation lines in the
data/make_dataset.py
script and in theMakefile
- Documentation and usage / Add the Makefile command in the Usage section of the README.md file with the url of the source
- Don't forget to document your functions !
- And, you're done !
* It is good practice to rely on existing functions from data/utils.py
and data/helpers.py
to download and write files.