Learn how to use Prisma 2 with these datasets here.   


What is it?

A collection of easily-manageable, interesting, free-to-use datasets for learning and experimentation

Datasets are collections of data that corresponds to the contents of a single database table and relates to a certain subject. They can be leveraged to tell stories, help you investigate an issue, or give you deeper understanding into hypotheses.

A lot of applications and algorithms depend heavily on large samples of data to ensure quality. Generally speaking, the more complicated the task is, the more data is required. A lot of public datasets are available for you to explore, such as Awesome Public Datasets.

At Prisma, we recognize that a lot of open and public datasets are tailored for data science and machine learning purposes and we know that it is valuable to have a smaller batch of interesting and realistic data to work with that is also easy to set up.

Our goal is to curate a collection of datasets that contain smaller batches of interesting data that is also easy to set up with popular databases (i.e. MySQL, PostgreSQL).

Right now, we organize the collection by database type.

Dataset Collection


SQLite

Dataset Description
Climate data Climate data for over 100 cities
Spotify Spotify playlist data
Northwind Northwind (sales data for fictitious foods export-import company)

MySQL

Dataset Description
Climate data Climate data for over 100 cities

PostgreSQL

Dataset Description
Spotify Spotify playlist data
What to brew Beer stouts and how well they go with additions

❤️ We welcome contributions! ❤️

Please see our contribution guide.   

⭐ Star us on GitHub — it helps!   


😎 Engage with our community! 😎

Prisma 2 is not production-ready yet, so we value your feedback! If you run into problems with this tutorial or spot any mistakes, feel free to make a pull request or contact the team directly!


Twitter / Slack