🇺🇦
Speech Recognition for Ukrainian The aim of this repository is to collect information and datasets for speech recognition in Ukrainian.
Get in touch with us in our Telegram group: https://t.me/speech_recognition_uk
Implementations
Silero
- Silero v1: https://github.com/snakers4/silero-models (demo code: https://github.com/egorsmkv/ua-silero-demo, also there is a demo as a Telegram bot: https://t.me/ukr_stt_bot)
VOSK
- VOSK v3 nano (with dynamic graph): https://drive.google.com/file/d/1Pwlxmtz7SPPm1DThBPM3u66nH6-Dsb1n/view?usp=sharing (73 mb)
- VOSK v3 small (with dynamic graph): https://drive.google.com/file/d/1Zkambkw2hfpLbMmpq2AR04-I7nhyjqtd/view?usp=sharing (133 mb)
- VOSK v3 (with dynamic graph): https://drive.google.com/file/d/173cqiJUU0GUG4R-T5nziftQSU03Hmi5c/view?usp=sharing (345 mb)
- VOSK v3: https://drive.google.com/file/d/17umTgQuvvWyUiCJXET1OZ3kWNfywPjW2/view?usp=sharing (343 mb)
- VOSK v2: https://drive.google.com/file/d/1MdlN3JWUe8bpCR9A0irEr-Icc1WiPgZs/view?usp=sharing (339 mb, demo code: https://github.com/egorsmkv/vosk-ukrainian-demo)
- VOSK v1: https://drive.google.com/file/d/1nzpXRd4Gtdi0YVxCFYzqtKKtw_tPZQfK/view?usp=sharing (87 mb, an old model with less trained data)
DeepSpeech
- DeepSpeech using transfer learning from English model: https://github.com/robinhad/voice-recognition-ua
Development
- How to train own model using Kaldi (in Russian): https://github.com/egorsmkv/speech-recognition-uk/blob/master/vosk-model-creation/INSTRUCTION.md
Datasets
💪
Compiled dataset from different open sources + Companies + Community = 135.21GB / ~1000 hours - Mega: https://mega.nz/folder/mmQjjaKT#TSzSw7bv97IUGrVD7QfMQw (use MEGAcmd to download, downloading in a browser has speed limitations)
- Torrent file: https://academictorrents.com/details/50f7a8e6157a9c2e38919afee0a11d8145e35556 (121.77 GB)
Companies
- Mozilla Common Voice has the Ukrainian model: https://commonvoice.mozilla.org/uk/datasets
- M-AILABS Ukrainian Corpus Ukrainian: http://www.caito.de/data/Training/stt_tts/uk_UK.tgz
Community
- VoxForge Repository: http://www.repository.voxforge1.org/downloads/uk/Trunk/