13 free dataset sources for Machine Learning and Deep Learning applications
- Google Dataset Search – A search engine for datasets: https://datasetsearch.research.google.com/
- IBM’s collection of datasets for enterprise applications: https://developer.ibm.com/exchanges/data/
- Kaggle Datasets: https://www.kaggle.com/datasets
- Huggingface Datasets – A Python library for loading NLP datasets: https://github.com/huggingface/datasets
- A large list organized by application domain: https://github.com/awesomedata/awesome-public-datasets
- Computer Vision Datasets (a really large list): https://homepages.inf.ed.ac.uk/rbf/CVonline/Imagedbase.htm
- Datasetlist – Datasets by domain: https://www.datasetlist.com/
- OpenML – A search engine for curated datasets and workflows: https://www.openml.org/search?type=data
- Papers with Code – Datasets with benchmarks: https://www.paperswithcode.com/datasets
- Penn Machine Learning Benchmarks: https://github.com/EpistasisLab/pmlb/tree/master/datasets
- UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/index.php
- VisualDataDiscovery (for Computer Vision): https://www.visualdata.io/discovery
- Roboflow Public Datasets for computer vision: https://public.roboflow.com/
- 23 Best Free Human Annotated Datasets for Machine Learning https://www.iguazio.com/blog/best-free-human-annotated-datasets-for-ml/