Data literacy and accountability, delivering data-products
From Data in a spreedsheet to data as a service !
Thomas Roca, PhD, Economist & Data Strategist - Microsoft - January 29-31 - Hanoi
Stay in touch via Twitter, Github or LinkedIn
Dataviz workshop: slides
- Day 1 - Dataviz et Datastory Workshop
- Day 2 - Data Training Hanoi
- Day 3 - The data Product infrastructure
Participant's realizations:
- Erik: Changing farms and changing farmers in Southeast Asia
- Maria Angela Barua
- Louis Parker
- Thuy Khuong: Infant mortality rate
- Trong Phan
- Thuy Nguyen
- The Farmer and the Smart Carabao
- Nghiem Le
- Maureen Gregorio
- LNhu
- James: Disaster risk reduction strategy through Climate Smart Villages in Pakistan
- Brice: Watch out! your cocoa's urea is producing a lot of CO2 !
Tools we are going to use:
For this workshop, you will need to install a code editor:
We are also going to use:
- JSFiddle, a javascript online editor sign up here - free
- Carto an online map editor sign up here - free
- PowerBI Microsoft datavisualization dashboard tool sign up here
- & Power Maps for Excel, you can read about it here
For thoses who wants to go further and try Cloud Services:
- Azure, Microsoft Cloud services: try a month for free
To save time during the workshop, register to JSFiddle and Carto !!
Introduction: from people to data....to people from Data in a spreedsheet to data as a service
- Data accountability
- e.g. data.afd.fr; OPAL
I. Basic introduction to data representation [15min]
- What, and why dataviz ?
- Better understanding
- Communication
- Interactivity and Story telling
- Using packages with Stata, Python and R
- SIG using cloudbased soft Microsoft Office 365' PowerBI and Carto
Practical: Create a heatmap with Carto [30min]
- map Kenya Health sites using CSV file from HDX plateform;
- Map Cameroon population in 2015 using the data in the Github folder (CMR sub)
Part II. When statistician met the web
II. Basic instroduction to the web languages: HTML, CSS, JavaScript [20min]
JavaScript library & dataviz
- dataviz the example of Highcharts
- SIG using Highmaps and leaflet
- Using folium and python
III. Data and programmming for the web: [30min]
A tiny bit of code [20min]
- Quick intro to JavaScript
When data comes in [10min]
- Data are not only stored in xls!
- Text format to store data: CSVs, TSVs, etc.
- Structured text data:
- Intro to XML
- Intro to JSON
- Intro to GeoJSON and shapefiles
- usefull link: https://mygeodata.cloud/
- StatSilk Shapefiles
Practical: Option A. Create a DataStory with Highcharts and Carto
- use Data Story Template in the github repository It can be about :
- health
- Education
- sociaux economic conditions etc. You can use data sources such as the World Bank indicators, UN OCHA HDX platform, etc.
Part III. When data meet the cloud
IV. Getting real time data: from scrapper to API, the data revolution: [20min]
- Introduction: what is an API ?
- Use cases:An application leveraging World Bank API
- Example: DHS API app': http://data.afd.fr/DataTools/DHS/DHS+browser.html
Pratical: OPtion B. Use world bank API & the DHS application
- World bank API
- GDP growth (annual %) of Kenya (WDI)
- last 20 observation
- using highcharts
- DHS API
- Create a heatmap using leaflet
- display age specific literacy rate 15-19
- subnational level in Kenya
- for the last DHS available
IV. Leveraging Cloud computing and Data science algorithm as a service: [20min]
- Introduction: what is an API ?
- Use cases:An application leveraging World Bank API
- Example: DHS API app': http://data.afd.fr/DataTools/DHS/DHS+browser.html
- How API works ? Creating your first API
V. Deploying data products on the cloud
- Introduction to Flask
- A simple scraping algorithm on Azure
- Automating data tasks on Azure
- Deploying Machine Learning Algorithm on the web and user interface
Further reading:
- Friendly, M., 1999, "Re-Visions of Minard", link
- Friendly, M., 2008, "A Brief History of Data Visualization" link
- Lupi, H., Posavec, S.,2016, "Dear Data", http://www.dear-data.com/
- Yau, N. 2011, visualize this: the flowingdata guide to design, visualization, and statistics link
- The Smithonianmag:"The Revolutionary Infographics of W.E.B. Du Bois And Booker T. Washington" link
- ToucanTouco, Infographie vs Dataviz: Faites la différence. Link
- Tufte, E. R. The Visual Display of Quantitative Information. Graphics Press, Cheshire, CT, 1983.
- Hagley, J. What is the difference bewteen infographic and dataviz ? Link
Tutorials
-
online tools
- Pick your color with Color brewer
- Read your JSON with jsoneditoronline.org
- store your JSON with myjon.com
- Convert shapefile to GeoJSON with mygeodata
-
Code sample we saw on JSfiddle:
- SVG
- highcharts
- leaflet
- Javascript
- Interaction
- parse JSON data
- How to make an API Call
- Draw a graph with from API data
- James' graph! Call Api, user interface and graph link
- DHS API with leaflet
-
Highcharts:
-
Carto:
- Video tutorial for Carto: in french, in spanish
- Carto Workshop tutorial:https://github.com/CartoDB/carto-workshop
-
HTML, CSS and JS:
DHS META DATA:
Meta data as presented on : http://spatialdata.dhsprogram.com/data/#/common/download
Countries: AL, AO, AM, AZ, BD, BJ, BO, BT, BR, BF, BU, KH, CM, CF, TD, CO, KM, CG, CD, CI, DR, EC, EG, ES, ER, ET, GA, GH, GU, GN, GY, HT, HN, IA, ID, JO, KK, KE, KY, LS, LB, MD, MW, MV, ML, MR, MX, MB, MA, MZ, NM, NP, NC, NI, NG, PK, PY, PE, PH, RW, ST, SN, SL, ZA, LK, SD, SZ, TJ, TZ, TH, GM, TL, TG, TT, TN, TR, TM, UG, UA, UZ, VN, YE, ZM, ZW
Indicators:
(List of indicator DHS)[https://api.dhsprogram.com/rest/dhs/indicators?f=html]