This repository contains the dataset of 250K Scratch projects as described in paper A Dataset of Scratch Programs: Scraped, Shaped and Scored.
The dataset is available as:
- a MySQL database dump, in Gdrive: /MySQL/
- an SQL Server database backup, in Gdrive: /SQLServer/
- CSV files, in folder /Dataset/CSV files/ or in Gdrive: /CSV files/. The version on Gdrive includes headers!
The JSON files of the scraped Scratch projects are available in GDrive: https://drive.google.com/file/d/0B5RLHmerPR2SZ25XMWI5SGxhbTA/view?usp=sharing
The source files of the scraping program that we used for obtaining this information from the Scratch website are in the /Scraper folder.