Speech-to-Text for the Ukrainian language based on Silero

This is a repository with demonstration code that uses the Silero Model for Ukrainian in the task of Speech-to-Text recognition.

Quality:

Common Voice 7 test set with 4300+ samples:

WER: 0.2318 (id est - quality is 76.82%)
CER: 0.0624

Install dependencies and enter the python environment:

pipenv install
pipenv shell

Run the demos:

python test_official_demo.py
python own_recodings_demo.py

Install PyTorch and other libraries:

conda install pytorch torchvision torchaudio cpuonly -c pytorch

Run the demos:

cd C:\path\to\project

python .\test_official_demo.py
python .\own_recodings_demo.py

It was tested on Windows x64 only.

egorsmkv/ua-silero-demo