It’s Graph Connect time again and for this year’s hackathon we’ve gone with a transport theme.
If you used the installer for Neo4j 3.0, use this tool from Will to load the semicolon-separated files:
Otherwise if you used the tar.gz Installation, you can use neo4j-shell
$NEO4J_HOME/bin/neo4j-shell -file path/to/file.cypher
The folllowing are some datasets to get you started:
runtimes.csv contains underground stations, distances between adjacent stations and the run time between stations on different lines.
You can use this Cypher script to load the data into Neo4j.
Alternatively you could download data from the TFL unified API. You can get accident stats, train and bus routes, disruptions and a few other things as well. You can also see the docs page for this API.
Traffic-major-roads-miles.csv contains 250,000 of the major roads in the UK, how they’re connected to each other and the traffic volume by vehicle type.
This document explains the dataset in more detail.
Jacqui Reed also has a graphgist showing different queries around the major roads in Staines!
Road Safety Data contains information on road accidents stretching back to 1979.
You can also download accidents for single years if you want to work with a smaller dataset.
Street-level crime, outcome, and stop and search information, broken down by police force.
Reviews of airlines, airports, seats and lounges from Skytrax. Possibly a great match for using AlchemyAPI sentiment analysis tool (see below).
AlchemyAPI provides semantic text analysis tools, such as sentiment analysis. The folks from IBM have gracious donated an API for use during this event.
-
API key: c9d93252138b9fc4516359cf7ad4febf5feb4387
If none of those appeal then you can find plenty of other ones on the Department of Transport page on data.gov.uk.