AbaiCPyJ's Stars
alex/what-happens-when
An attempt to answer the age old interview question "What happens when you type google.com into your browser and press enter?"
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
bharathgs/Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
udacity/deep-learning-v2-pytorch
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
markovka17/dla
Deep learning for audio processing
zycv/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
sberdevices/golos
yandexdataschool/speech_course
YSDA course in Speech Processing.
MUSoC/Visualization-of-popular-algorithms-in-Python
Visualization of popular algorithms using NetworkX Graph libray
mbinkowski/DeepSpeechDistances
Authors' implementation of DeepSpeech Distances.
IS2AI/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
aakashverma1124/Data-Structures-and-Algorithms-for-Interviews
This repository contains codes for Webinar on Linked List in Java, Python, and C++.
GeorgeFedoseev/DeepSpeech
Russian Speech Recognition system based on Mozilla's DeepSpeech TensorFlow implementation.
TextDatasetCleaner/TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
vlomme/Bert-Russian-punctuation
Простая модель расстановки запятых на основе BERT
Kirili4ik/QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
Mega4alik/Text-Normalization-for-Kazakh-Language
IvanRychkov/toads
Helpers for Data Science projects
AbaiCPyJ/Does-the-student-cheat-
The program checks whether the student turns away in distance learning or exam using a webcam.
dangrebenkin/audiocorpusbuilder
Command-line package for automatical creation of russian language audio corpus (pairs speech-text) from YouTube audiotracks and subtitles