MOS score prediction by fine-tuned wav2vec2.0 model
Keywords: MOS-Net, MB-Net, PESQ, STOI, speech quality
The package installation was tested with python3.9
pip install git+https://github.com/AndreevP/wvmos
from wvmos import get_wvmos
model = get_wvmos(cuda=True)
mos = model.calculate_one("path/to/wav/file") # infer MOS score for one audio
mos = model.calculate_dir("path/to/dir/with/wav/files", mean=True) # infer average MOS score across .wav files in directory
This work was done for the deep learning course in Skolteh university by Pavel Andreev, Nikolay Patakin, Oleg Desheulin, Alexander Kagan and Arthur Bulanbaev. More details are described in paper https://arxiv.org/abs/2203.13086