This data lives at https://github.com/vega/vega-datasets
Common repository for example datasets used by vega related projects. Keep changes to this repository minimal as other projects (vega, vega-editor, vega-lite, polestar, voyager) use this data in their tests and for examples.
The list of sources is in sources.md.
Add this to your package.json:
"vega-datasets": "vega/vega-datasets#gh-pages"
You can also get the data directly via HTTP served by Github like:
https://vega.github.io/vega-datasets/data/cars.json
You can use git subtree to add these datasets to a project. Add data git subtree add
like:
git subtree add --prefix path-to-data git@github.com:vega/vega-datasets.git gh-pages
Update to the latest version of vega-data with
git subtree pull --prefix path-to-data git@github.com:vega/vega-datasets.git gh-pages
- Add weather data for Seattle and New York.
- Add income, zipcodes, lookup data, and a dataset with three independent geo variables.
- Remove all tabs in
github.csv
to prevent incorrect field name parsing.
- Dates in
movies.json
are all recognized as date types by datalib. - Dates in
crimea.json
are now in ISO format (YYYY-MM-DD).
- Fix
cars.json
date format.
- Add Gapminder Health v.s. Income dataset.
- Add generated Github contributions data for punch card visualization.
- Add Anscombe's Quartet dataset.
- Change date format in weather data so that it can be parsed in all browsers. Apparently YYYY/MM/DD is fine. Can also omit hours now.
- Decode origins in cars dataset.
- Add Unemployment Across Industries in US.
- Fixed the date parsing on the CrossFilter datasets -- an older version of the data was copied over on initial import. A script is now available via
npm run flights N
to re-sampleN
records from the originalflights-3m.csv
dataset.
- Add
seattle-weather
dataset. Transformed with https://gist.github.com/domoritz/acb8c13d5dadeb19636c.
- Initial import from Vega and Vega-Lite.
- Change field names in
cars.json
to be more descriptive (hp
toHorsepower
).