Pinned Repositories
asr_evaluation_scripts
Scripts for the low-resource ASR shared task
base_rus_whisper_stt
Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos
deep-learning-coursera
Deep Learning Specialization by Andrew Ng on Coursera.
deep_learning_python_intro
Materials for the course on programming deep neural networks in Python (Russian)
diy-alexa
Command recognition research
DL4Img
书籍《深度学习技术图像处理入门》代码环境 Docker 文件
mlcourse.ai
Open Machine Learning Course
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Speech_38_ru_commands
Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR
tflite_avto_num_recognation
License plate recognition . Model training and conversion to tflite
sovse's Repositories
sovse/Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
sovse/base_rus_whisper_stt
Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos
sovse/tflite_avto_num_recognation
License plate recognition . Model training and conversion to tflite
sovse/Speech_38_ru_commands
Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR
sovse/mlcourse.ai
Open Machine Learning Course
sovse/asr_evaluation_scripts
Scripts for the low-resource ASR shared task
sovse/deep-learning-coursera
Deep Learning Specialization by Andrew Ng on Coursera.
sovse/deep_learning_python_intro
Materials for the course on programming deep neural networks in Python (Russian)
sovse/diy-alexa
Command recognition research
sovse/DL4Img
书籍《深度学习技术图像处理入门》代码环境 Docker 文件
sovse/dla
Deep learning for audio processing
sovse/FROM-HSE-DM-ML
sovse/INTERSPEECH19_TUTORIAL
Interspeech 2019 tutorial materials
sovse/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
sovse/jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
sovse/las
tf 2.0 implementation of Listen, attend and spell
sovse/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
sovse/maskcam
Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.
sovse/pycon-speech-recognition-2017
PyCon Russia 2017 speech recognition examples
sovse/python-pesq
A python package for calculating the PESQ.
sovse/RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
sovse/rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
sovse/SoundNet-tensorflow
TensorFlow implementation of "SoundNet".
sovse/speech_hackathon_2019
sovse/tensorflow-on-orange-pi
TensorFlow for Orange Pi - In Work
sovse/tutorial_wav2vec2
Tutorial speech recognition fine tuning and inference wav2vec2
sovse/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).
sovse/warp-ctc
Pytorch Bindings for warp-ctc
sovse/WaveNet
Pytorch implement WaveNet