victorbcyang's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
xai-org/grok-1
Grok open release
ml-explore/mlx
MLX: An array framework for Apple silicon
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
ml-explore/mlx-examples
Examples in the MLX framework
spotify/pedalboard
🎛 🔊 A Python library for audio.
koute/bytehound
A memory profiler for Linux.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
bbqsrc/cargo-ndk
Compile Rust projects against the Android NDK without hassle
llohse/libnpy
C++ library for reading and writing of numpy's .npy files
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Le-Xiaohuai-speech/DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
GAMMA-UMD/pygsound
Impulse response generation based on state-of-the-art geometric sound propagation engine.
bycloudai/SwapCudaVersionWindows
How to swap/switch CUDA versions on Windows
cpuimage/resampler
A Simple and Efficient Audio Resampler Implementation in C
chirlu/soxr
The SoX resampler library
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
sungwon23/BSRNN
csukuangfj/kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
microsoft/SIG-Challenge
Okrio/CRUSE
a lightweight network for monaural speech enhancement
tencent-ailab/UltraDualPathCompression
A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression".
echocatzh/GFTNN
Gated Convolutional F-T-LSTM Neural Network
enhancer12/TSPNN
Two-stage progressive neural network for acoustic echo cancellation
uw-x/AcousticSwarms-Speech
LMSAudio/Complex_PF
RES via complex-valued DNN
ShiftMediaProject/soxr
Unofficial Soxr with added custom native Visual Studio project build tools. Soxr: A library for performing one-dimensional sample-rate conversion.
breizhn/sms_wsj
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition