- Kaggle
- Registry of Open Data on AWS
- JSONPlaceholder - Fake Online REST API for Testing and Prototyping
- scikit-learn
- awesomedata/awesome-public-datasets
- Important, commonly-used datasets in high quality, easy-to-use & open form as data packages
- Enron Email Dataset
- Google Books Ngrams
- Million Song Dataset
- London Stock Exchange Group - TRADE REPOSITORY PUBLIC DATA
- cooldatasets.com
- English-language Wikipedia
- UCI Machine Learning Repository
- OpenRefine/OpenRefine
- OpenRefine is a free, open source power tool for working with messy data and improving it.