This repository provides the organ tablature ocr data set used in the paper Automatic Transcription of Organ Tablature Music Notation with Deep Neural Networks published in Transactions of the International Society for Music Information Retrieval 4(1) in 2021 (http://doi.org/10.5334/tismir.77). The data set consists of a training, validation and testing subset.
The data consists on the one hand of real tablature staves extracted from the scans of two organ tablature books:
- "Orgel oder Instrument Tabulaturbuch" (’Organ or InstrumentTablature Book’) from 1583
- "Ein new künstlich Tabulaturbuch" (’A new artificial tablaturebook’) from 1575
Both books were written by German Organist and Composer Elias Nikolaus Ammerbach. We manually annotated 1200 staves from each book with label sequences.
To increase the amount of available data we employed data augmentation and a synthetic data generator that randomly arranges images of single characters into tablature rows.
The whole data set requires almost 90GB of disc space, which is why instead of the whole data set we distribute our generation and augmentation tools along with the required source images. The annotated real tablature staves are made available in a cloud storage and can be downloaded from there.
The datasets.ipynb
ipython notebook is provided for building the data set locally.
It automatically downloads the required data and runs the data generator and data augmentor with the appropriate parameters to create the training, validation and test set.
Further instructions are provided in the notebook itself.
The following packages need to be installed to run the generator:
- Jupyter:
pip install jupyter
- Numpy:
pip install numpy
- Pillow:
pip install Pillow
- Augmentor:
pip install Augmentor
The src
folder contains all the python code of the data generator and data augmentor program.
The data
directory contains all tablature images and is structured as follows:
generatorSources
: Contains source files for the tablature generatorbackgrounds
: source images for backgrounds and image bordersduration
: source images for duration tablature charactersnote
: source images for note pitch tablature charactersrest
: source images for rest tablature charactersspecial
: source images for special tablature characters (measure lines, repetition signs, text blocks, ...)
realdataSources
: Will be created during data set building. Will contain annotated real organ tablature staves from the two tablature booksgeneratorOutput
: Will be created during data set building. Will serve as output directory for the data generatordatasetOutput
: Will be created during data set building. Will serve as output directory for the final data sets