This is a repository accompanying the following paper:
@inproceedings{StrahlOBM25_TextAlignSynth_SMC,
author = {Sebastian Strahl and Yigitcan {\"O}zer and Hans-Ulrich Berendes and Meinard M{\"u}ller},
title = {Hearing Your Way Through Music Recordings: A Text Alignment and Synthesis Approach},
booktitle = {Proceedings of the Sound and Music Computing Conference ({SMC})},
address = {Graz, Austria},
year = {2025}
}This repository contains an implementation of parts of the processing pipeline described in above paper. The implementation comprises text comment generation for the case studies described in the paper, text-to-speech synthesis using the TTS python package, post-processing of the synthesized speech signals, and superposition with the original recording.
For details and references, please see the paper.
We recommend setting up a Python environment including Pytorch before installing textalignsynth. You may use the example environment provided as part of this package:
git clone https://github.com/groupmm/textalignsynth.git
cd textalignsynth
conda env create -f environment.yaml
conda activate textalignsynthpip install "git+https://github.com/groupmm/textalignsynth.git#egg=textalignsynth"git clone https://github.com/groupmm/textalignsynth.git
cd textalignsynth
pip install -e .⚠️ Does not work on Windows machines! Workaround: Use Windows Subsystem for Linux (WSL).⚠️ German TTS model requiresespeak-ngorespeakto be intalled on the machine!
Automated code style checks via pre-commit:
pip install pre-commit
pre-commit installThe code for this toolbox is published under an MIT license. This does not apply to the data files:
- Schubert songs are taken from the Schubert Winterreise Dataset.
- Beethoven pieces are taken from the Beethoven Piano Sonatas Dataset.
- Wagner operas are taken from the Wagner Ring Dataset.
This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) Grant No. 500643750 (MU 2686/15-1). The authors are with the International Audio Laboratories Erlangen, a joint institution of the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Fraunhofer Institute for Integrated Circuits IIS.