/smude

Sheet music dewarping, primarily for pictures taken with smartphones.

Primary LanguagePythonOtherNOASSERTION

Smude - Sheet Music Dewarping ⚠️ beta ⚠️

Smude is a library dedicated to binarization and dewarping/rectification of sheet music images taken with smartphones:

Installation

Using Conda

Clone this repository and :

$ git clone https://github.com/sonovice/smude.git
$ cd smude
$ conda env create -f environment.yml
$ conda activate smude

Using Python

Clone this repository and install the package using the setup.py:

$ git clone https://github.com/sonovice/smude.git
$ cd smude
$ python3 setup.py install

Usage

Via the command line

Installing the package adds a command-line interface called smude. smude -h provides the instructions.

$ smude -h
usage: smude [-h] [-o OUTFILE] [--no-binarization] [--use-gpu] infile

Dewarp and binarize sheet music images.

positional arguments:
  infile                Specify the input image

optional arguments:
  -h, --help            show this help message and exit
  -o OUTFILE, --outfile OUTFILE
                        Specify the output image (default: result.png)
  --no-binarization     Deactivate binarization
  --use-gpu             use GPU

Using the Library

See example.py for a simple usage example:

from skimage.io import imread, imsave
from smude import Smude

image = imread('images/input_fullsize.jpg')
smude = Smude(use_gpu=False, binarize_output=True)
result = smude.process(image)
imsave('result.png', result)

⚠️ Smude will download a ~348 MB Deep Learning model on the first run!

Approach

Rectification of the pages of sheet music is divided into several steps:

  1. Extraction of the sheet music page from a smartphone image, the so-called "Region of Interest" (ROI)
  2. Adaptive binarization
  3. Pixelwise segmentation into the classes "upper staff line", "lower staff line" and "bar line" using U-Net
  4. Vanishing point estimation
  5. Spline interpolation for staff lines
  6. Dewarping

The Deep Learning model was trained on thousands of public domain scores downloaded from musescore.com, heavily augmented with plain xml foo, rendered with Verovio and artificially warped using code from NVlabs/ocrodec.

The actual dewarping algorithm is loosely based on this paper:

Meng, G. et. al. (2012): Metric Rectification of Curved Document Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), p. 707-722.

(DOI: https://doi.org/10.1109/TPAMI.2011.151)

Side Notes

Smude works best under these conditions:

  • The entire page should be covered in the input image plus some extra margins.
  • Make sure the sheet music page is evenly lit.
  • Unsharp/defocused images may work but mostly won't.
  • The dewarping algorithm assumes that the curved page shape is a General Cylindric Surface. In practice, these are usually pages that are bound in a book and thus often exhibit a curvature when opened.

License

This repository is under the "Commons Clause" License Condition v1.0 on top of GNU AGPLv3.