/midi2voice

Singing synthesis from MIDI file

Primary LanguagePythonMIT LicenseMIT

Singing Synthesis from MIDI file

This script relies on the sinsy.jp website from the Nagoya Institute of Technology which implements a HMM-based Singing Voice Synthesis System.

You can find a sample merged with the instrumental audio here.

Requirements

  • musescore: It's used to convert midi to musicxml
  • python 3

Installation

pip install git+git://github.com/mathigatti/midi2voice.git

Usage

You can use it running the installed module using python -m midi2voice. It has serveral parameters, most are optional and you need to use them only if you want to specify something different than the default

python -m midi2voice
  -h, --help # show this help message and exit
  -l LYRICS, --lyrics LYRICS # Path to txt file containing the lyrics
  -m MIDI, --midi MIDI # Path to midi file
  -lang {english,japanese,mandarin} # Language of the voice (OPTIONAL / DEFAULT: english)
  -g {female,male} # Gender voice (female/male) (OPTIONAL / DEFAULT: female)
  -i VOICEINDEX # Each language has different voices, for example japanese has 4 different female voices at the moment, mandarin only one. (OPTIONAL / DEFAULT: 0)
  -t TEMPO # Song tempo in BPMs (OPTIONAL / DEFAULT 80)
  -s SYNALPHA # Gender parameter [between -0.8 and 0.8, default: 0.55]
  -v VIBPOWER # Vibrato intensity [between 0 and 2, default: 1]
  -p F0SHIFT # Pitch shift [in half tones, between -24 and 24, default: 0]
  -d DESTINATION_FOLDER # Destination folder

Usage example

Check the midi and text samples here.

# Print help
python3 -m midi2voice -h

# Basic example
python -m midi2voice -l shallow.txt -m shallow.mid

# Generate the voice given a midi file and a text file with the lyrics
python -m midi2voice -l shallow.txt -m shallow.mid -lang english -g female -t 96

Try it on Colab

If you don't have python installed or you just want to check it quickly you can try it online here.

Open In Colab

Voices available

At the moment sinsy.jp provides this voices. You can use the VOICEINDEX parameter to choose them.

  • Japanese

    • Female

      • 0 - f00001j_dnn_beta4 : Yoko : Japanese
      • 1 - f00002j_dnn_beta4 : Xiang-Ling : Japanese
      • 2 - f01018j_dnn_beta4 : undefined : Japanese
      • 3 - f00001j : Yoko : Japanese
      • 4 - f00002j : Xiang-Ling : Japanese
      • 5 - f00004j_beta : Namine Ritsu S : Japanese
      • 6 - f00005j : undefined : Japanese
    • Male

      • 0 - m01083j_dnn_beta4 : undefined : Japanese
      • 1 - m01083j : undefined : Japanese
  • English

    • Female

      • 0 - f00002e_dnn_beta4 : Xiang-Ling : English
      • 1 - f00002e : Xiang-Ling : English
    • Male

      • 0 - m00003e_beta : Matsuo-P : English
  • Mandarin

    • Female

      • 0 - f00002m : Xiang-Ling : Chinese (Mandarin)
    • Male

      • None at the moment

Credits

This source code was developed by Mathias Gatti (@mathigatti) if you use it please remember to cite me. For scientific publications you can use this DOI.

Gatti, M. (2020). mathigatti/midi2voice v1.0.0 (v1.0.0) [Computer software]. Zenodo. https://doi.org/10.5281/ZENODO.3969003

DOI

Support my work

If you want to help me to keep going developing and maintaining open-source projects you can contribute buying me some ko-fi.

License

MIT