/SMT

Official implementation of the Sheet Music Transformer

Primary LanguagePythonMIT LicenseMIT

PRAIG-logo LITIS-logo

Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription

Python PyTorch Lightning License

AboutHow To UseCitationsAcknowledgmentsLicense

About

This GitHub repository contains the implementation of the Sheet Music Transfomrmer (SMT), novel model for Optical Music Recognition (OMR) beyond monophonic level transcription. Unlike traditional approaches that primarily leverage monophonic transcription techniques for complex score layouts, the SMT model overcomes these limitations by offering a robust image-to-sequence solution for transcribing polyphonic musical scores directly from images.

content

How to use

Usage instructions coming soon

Citations

@misc{riosvila2024SMT,
	title        = {Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription},
	author       = {Antonio Ríos-Vila and Jorge Calvo-Zaragoza and Thierry Paquet},
	year         = 2024,
	eprint       = {2402.07596},
	archiveprefix = {arXiv},
	primaryclass = {cs.CV}
}

Acknowledgments

This work is part of the I+D+i PID2020-118447RA-I00 (MultiScore) project, funded by MCIN/AEI/10.13039/501100011033. Computational resources were provided by the Valencian Government and FEDER funding through IDIFEDER/2020/003.

License

This work is under a MIT license.