Download NYC subway turnstile data files from the MTA's website and load them into a database
Hastily put together in March 2020, structured as a Rails app but the only functionality is to download files and load them into a Postgres table called turnstile_observations
The repo does not currently support the MTA file formats for the time period 5/5/2010–10/11/2014
See here for a dashboard with up-to-date graphs. Data updates weekly on Saturday mornings.
bundle exec rake db:setup
lib/stations.csv
includes a manual mapping of station to borough name for every observed unique combination of station
/line_names
/division
in the raw data. It is used to seed the stations
table, which can then be joined to the turnstile_observations
table to calculate aggregates by borough
From the Rails console, import a single file:
TurnstileObservation.import_turnstile_observations(date: "2020-03-21".to_date)
Import all files:
TurnstileObservation.
all_available_dates.
select { |d| d >= TurnstileObservation::FIRST_DATE_WITH_NEW_FORMAT }.
each { |d| TurnstileObservation.import_turnstile_observations(date: d) }
After importing files, you have to run another query to set the net_entries
and net_exits
colums:
TurnstileObservation.set_net_entries_and_net_exits