Minimum Word Error Rate Training for Speech Separation

Minimum word error training approach using VoiceFilter and DeepSpeech.

Result

Result based on 80,000 iteration.
The test dataset is fully mixed, therefore, both SDR and WER of the original mixture is already poor.
To compare to the performance of original, need to use partially mixed dataset.

wget https://github.com/mozilla/DeepSpeech/releases/download/v0.5.0/deepspeech-0.5.0-models.tar.gz
tar xvfz deepspeech-0.5.0-models.tar.gz

  pip install -r requirement.txt

  python client.py 8080
  python client.py 8081
  ...
  python client.py 8087

#In deepspeech-client
(venv) ./generator.sh [PATH_OF_AUDIO_FILE]

Apache License 2.0

This repository contains codes adapted/copied from the followings: