/omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Primary LanguagePythonMIT LicenseMIT

OMNIZART

build docs PyPI version PyPI - License Downloads Docker Pulls

Omnizart is a Python library that aims for democratizing automatic music transcription. Given polyphonic music, it is able to transcribe pitched instruments, vocal melody, chords, drum events, and beat. This is powered by the research outcomes from Music and Culture Technology (MCT) Lab.

Transcribe your favorite songs now in Colab! Open In Colab

Quick start

Visit the complete document for detailed guidance.

Pip

# Install omnizart
pip install omnizart

# Download the checkpoints
omnizart download-checkpoints

# Transcribe your songs
omnizart drum transcribe <path/to/audio.wav>
omnizart chord transcribe <path/to/audio.wav>
omnizart music transcribe <path/to/audio.wav>

Docker

docker pull mctlab/omnizart:latest
docker run -it mctlab/omnizart:latest bash

Supported applications

Application Transcription Training Evaluation Description
music ✔️ ✔️ Transcribe musical notes of pitched instruments.
drum ✔️ ⁉️ Transcribe events of percussive instruments.
vocal ✔️ ✔️ Transcribe note-level vocal melody.
vocal-contour ✔️ ✔️ Transcribe frame-level vocal melody (F0).
chord ✔️ ✔️ Transcribe chord progressions.
beat ✔️ ✔️ Transcribe beat position.

NOTES The current implementation for the drum model has unknown bugs, preventing loss convergence when training from scratch. Fortunately, you can still enjoy drum transcription with the provided checkpoints.

Compatibility Issue

Currently, Omnizart is incompatible for ARM-based MacOS system due to the underlying dependencies. More details can be found in the issue #38.