DC data lives in many different locations in many different formats - it may be on the official DC open data website, (briefly) live on a Dropbox folder for a hackathon, tucked away on an agency website, or an individual's Github. It can be difficult to navigate and even know if the data you want exists. Here's hoping this provides a modicum of clarity, making it easier for people to find the data they need and identify areas where data is needed. This list will also help inform a new Code for DC Open Data Portal.
Please contribute to the list! You can do so by:
- Submitting an issue
- Making a pull request
- E-mailing me at datalensdc@gmail.com
Or, if you there's particular data you want but don't know where it lives, submit an issue or e-mail me! We can find or FOIA it.
Topics | Agency | Summary (and hyperlink) | Data Type | Notes | Added to Code for DC Open Data Portal |
---|---|---|---|---|---|
Multiple | Multiple | official open data site | API; multiple | mostly geo. highlights include 3+ years of crime, ticketing, crashes and business licenses | |
Multiple | Multiple | Code for DC open data catalog | API; multiple | scraped/FOIA data, slightly dated. We're reviving it! | |
Geography | Multiple | All the Maps | GeoJSON | by Ben Balter | |
Demographics | Multiple | Demographics at many different geo levels | HTML/XLS | population,well-being,housing,foreclosures,schools | |
Education | OSSE | Best single resource on school data | html, csv | school profiles and performance.Benjamin Robinson created R package for data | |
Education | OSSE | school enrollment audits | XLSX | X | |
Education | PCSB | Charter performance, enrollment, lotteries | API; multiple | ||
Education | DCPS | DCPS school budgets,budgeted enrollment | XLSX | X | |
Education | DCPS | enrollment, grad rates, test scores | XLSX | X | |
Transportation | WMATA | Metro ridership, survey responses | XLSX | ||
Transportation | WMATA | routes, real time predictions, incidents | API; multiple | ||
Transportation | WMATA | Metro service disruptions | html | opendatadc.org has 2012-15. have scraper | |
Transportation | WMATA | Elevator/escator outage | html | opendatadc.org has a time series | |
Transportation | DDOT | 2006-2013 bike crash data | API; multiple | X | |
Transportation | DDOT | Traffic volume maps, 2002-11 | PDF map with volume notations near street | ||
Transportation | DDOT | DC Bike Count data | XLS | 2002-15 person-led bike counts | |
Transportation | Capital Bikeshare | station feed, trip history, member surveys | XML,csv, pdf | ||
Transportation | Arlington County | automated bike counts in VA, MD, and DC | XML | have scraper, need to productionalize | |
Transportation | DDOT | Permits issued by DDOT for use of public space | searchable database | ||
Food | ABRA | Liquor License Holders | Replaced every 6ish months;have two previous copies | ||
Food | DOH | rolling last 3 years food & hygiene inspections | HTML | have rudimentary scrapper; opendatadc.org has history 2010-2015 | |
Crime | MPDC | Crime 2000-14 | csv | ||
Crime | MPDC | DC Crime Map | csv | searchable database, annual datasets at opendata.dc.gov | |
Crime | MPDC | DC Crime Stats | html, pdf | citywide crime + traffic fatalities | |
City | DHR | DC Employee Salaries | X | ||
City | annual FOIA Report Statistics | annual FOIA request counts by agency | |||
City | Multiple | 311 Requests | multiple | Current 311 requests on the last 30 days map. opendata.dc.org has last 30 days datasets for request types. 2010-13 on opendatadc.org | |
City | Council | Legislative information | JSON | information about bills, resolutions, contracts and reports submitted to the Council | |
Building | DCRA | Certificate of Occupancy | XLSX | released during GS hackathons | |
Building(ish) | AirBnB | CSV | scraped October 3, 2015 | ||
Budget | OCFO | 7 years of DC budget visualized | HTML | maybe if you create an account you can download the raw data? | |
Budget | OCFO | DC Capital Improvement Plan, 2010-15 | XML/JSON | scraped by Chris Given! | |
Budget | DMPED | Great Streets Grantees | csv | ||
Environment | DOEE | Air Quality Data | HTML | ||
Environment | DOEE | Water Quality Data | HTML | River, not drinking, water |
Topics | Agency | Summary (and hyperlink) | Data Type | Notes |
---|---|---|---|---|
Science | NIH | ExPORTER: abstracts of all funded grants | XML,CSV | Patents,publications and clinical studies too, but these are incomplete |