Hop Suisse !

Repository for ADA project. The goal of this project is data scraping, analysis and visualization from the Datasport website, focusing on the "running" data.

A more detailed description of the steps followed can be found below.

Team

NB : The project is run by a new team, obtained by merging two ADA teams

Working steps

In this section we describe our steps, (more or less) chronologically.

project proposal :

Describes guideline, goals and objectives of the project.

global parsing :

From datasport main page, make requests to extract all the names, dates and places of every running competition, and the url links where to find the results.

Results :

links2runs.csv

ranking parsing :

From every url found in links2runs.csv, get all the information about every specific race, that is all the information about every runner : name, age, category, ranking, pace, etc. Note that given the way Datasport displays things, this is not an easy step 😁

Results :

full_database.csv (temporarily hosted on Dropox)

weather :

From links2runs.csv consider every date and place and (try to) find the corresponding weather and temperature in order to do performance analysis with respect to the weather/temperature. Due to the API used, such information for races older than July 2008 is not available.

Results :

races-information-weather.csv

gathering information :

Extra steps to build, on top of links2runs.csv, a more complete table containing the scraped information plus the weather information and GPS coordinates for each location when available.

Results :

races-information.csv

data analysis :

Data analysis on both particular cases like Lausanne Marathon and on the global dataset.

visualization :

Our goal is to display the gathered data and the analysis on a website, in a more "user-friendly" way than Datasport. The website pointed by the title above is an early attempt at mastering GitHub Pages, Jekyll, D3.js, Leaflet, etc... 😉

maximepeschard/hop_suisse

Hop Suisse !

Team

Working steps