KirillTaE's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
KwaiVGI/LivePortrait
Bring portraits to life!
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Rikorose/DeepFilterNet
Noise supression using deep filtering
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
DachunKai/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
marsbroshok/VAD-python
Voice Activity Detector in Python
hcmlab/vadnet
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
minzwon/sota-music-tagging-models
FUlyankin/matstat-AB
Курс по матстату для онлайна :)
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
hamadichihaoui/BIRD
This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
FusionBrainLab/OmniFusion
OmniFusion — a multimodal model to communicate using text and images
rishikksh20/hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
qiuqiangkong/panns_inference
ap-atul/Audio-Denoising
Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique
salute-developers/GigaAM
Foundational Model for Speech Recognition Tasks
zhenghuatan/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
haidog-yaqub/DiffPitcher
Diffusion-based singing voice pitch correction
mogwai/nanodrz
Speaker Diarization with Transformers
sa-if/Audio-Denoiser
Python based audio denoiser 🔉
ap-atul/wavelets-ext
A re-implementation of the Wavelets package using Cython to improve the speed.
MorenoLaQuatra/vad
Simple voice activity detection (VAD) algorithm in Python
muhd-umer/deep-suppressor
DeepSuppressor: A deep learning-based approach to speech denoising
byramsubramanian/yt-video-summarizer
Video Summarization Experiments with Open LLMs
georgid/vocal-detection
NTUT-LabASPL/FMA-C-DataSet-for-Vocal-Detection
Vocal detection data sets for deep learning.