sovse's Stars
mct10/RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
lifeiteng/naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
alphacep/vosk-tts
Text To Speech Synthesis with Vosk
wenet-e2e/wesubtitle
用 OCR 提取视频硬字幕
sovse/Speech_38_ru_commands
Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR
alphacep/awesome-russian-speech
Russian speech technology links
dctian/DeepPiCar
Deep Learning Autonomous Car based on Raspberry Pi, SunFounder PiCar-V Kit, TensorFlow, and Google's EdgeTPU Co-Processor
DWCTOD/cv-arxiv-daily
CARNIVAL-IITP/Automatic_gain_control
luiszeni/yolact_onnx
A simple, fully convolutional model for real-time instance segmentation.
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
mocleiri/tensorflow-micropython-examples
A custom micropython firmware integrating tensorflow lite for microcontrollers and ulab to implement the tensorflow micro examples.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
dusty-nv/jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
Koziev/rusyllab
Simple Python package for breaking Russian words into syllables
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
nasir0md/unsupervised-learning-entrainment
This repository contains the scripts for the models of deep unsupervised learning of vocal entrainment
Malkovsky/interactive-visualization
MahmoodGhouri001/deskew-scanned-images
sovse/tflite_avto_num_recognation
License plate recognition . Model training and conversion to tflite
snakers4/open_stt
Open STT
sovse/Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
vault-42/AIND_DNN_Speech_Recognizer
End-to-end speech to text recognition
ainy/shershe
Speech recognition dataset based on russian audiobook, sentance-level split
mleimeister/ctc_tensorflow_voxforge
Simple example how to use tensorflow's CTC loss with Voxforge speech data
standy66/pycon-speech-recognition-2017
PyCon Russia 2017 speech recognition examples