GraphHack - Graph Connect Europe 2016

It’s Graph Connect time again and for this year’s hackathon we’ve gone with a transport theme.

The folllowing are some datasets to get you started:

Transport for London

runtimes.csv contains underground stations, distances between adjacent stations and the run time between stations on different lines.

You can use this Cypher script to load the data into Neo4j.

Alternatively you could download data from the TFL unified API. You can get accident stats, train and bus routes, disruptions and a few other things as well. You can also see the docs page for this API.

Roads

Traffic-major-roads-miles.csv contains 250,000 of the major roads in the UK, how they’re connected to each other and the traffic volume by vehicle type.

This document explains the dataset in more detail.

Jacqui Reed also has a graphgist showing different queries around the major roads in Staines!

Road Safety

Road Safety Data contains information on road accidents stretching back to 1979.

You can also download accidents for single years if you want to work with a smaller dataset.

Airlines

UK Crime Data - data.police.uk

Street-level crime, outcome, and stop and search information, broken down by police force.

Skytrax Air Travel Reviews

Reviews of airlines, airports, seats and lounges from Skytrax. Possibly a great match for using AlchemyAPI sentiment analysis tool (see below).

AlchemyAPI

AlchemyAPI provides semantic text analysis tools, such as sentiment analysis. The folks from IBM have gracious donated an API for use during this event.

None of the above

If none of those appeal then you can find plenty of other ones on the Department of Transport page on data.gov.uk.

Helpful Info


Graph Connect Europe 2016

April 25th/26th 2016

gc london logo round