victorbcyang

HP Inc.Boston, MA

victorbcyang's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python67.7k 564 08k
xai-org/grok-1
Grok open release
Language:Python49.4k 562 2098.3k
ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++16.5k 141 510943
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.1k 694 1.6k5.3k
ml-explore/mlx-examples
Examples in the MLX framework
Language:Python5.8k 68 458829
spotify/pedalboard
🎛 🔊 A Python library for audio.
Language:C++5.1k 55 184259
koute/bytehound
A memory profiler for Linux.
Language:C4.4k 59 83188
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4k 50 227395
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Language:C2k 49 82405
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k 17 9294
bbqsrc/cargo-ndk
Compile Rust projects against the Android NDK without hassle
Language:Rust677 16 8462
llohse/libnpy
C++ library for reading and writing of numpy's .npy files
Language:C++360 7 1870
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Language:C++186 7 3735
Le-Xiaohuai-speech/DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
Language:Python181 2 4140
GAMMA-UMD/pygsound
Impulse response generation based on state-of-the-art geometric sound propagation engine.
Language:C++140 1 921
bycloudai/SwapCudaVersionWindows
How to swap/switch CUDA versions on Windows
139 1 111
cpuimage/resampler
A Simple and Efficient Audio Resampler Implementation in C
Language:C137 6 460
chirlu/soxr
The SoX resampler library
Language:C120 10 842
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
Language:Python84 5 1510
sungwon23/BSRNN
Language:Python79 2 713
csukuangfj/kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
Language:C++74 4 1119
microsoft/SIG-Challenge
Language:Python72 15 35
Okrio/CRUSE
a lightweight network for monaural speech enhancement
Language:Python47 4 210
tencent-ailab/UltraDualPathCompression
A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression".
Language:Jupyter Notebook36 3 12
echocatzh/GFTNN
Gated Convolutional F-T-LSTM Neural Network
Language:HTML32 1 512
enhancer12/TSPNN
Two-stage progressive neural network for acoustic echo cancellation
Language:Python32 4 211
uw-x/AcousticSwarms-Speech
Language:Python27 5 07
LMSAudio/Complex_PF
RES via complex-valued DNN
Language:Python22 5 214
ShiftMediaProject/soxr
Unofficial Soxr with added custom native Visual Studio project build tools. Soxr: A library for performing one-dimensional sample-rate conversion.
Language:C13 7 016
breizhn/sms_wsj
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
Language:Python2 0 00

victorbcyang

victorbcyang's Stars

openai/whisper

xai-org/grok-1

ml-explore/mlx

kaldi-asr/kaldi

ml-explore/mlx-examples

spotify/pedalboard

koute/bytehound

snakers4/silero-vad

wiseman/py-webrtcvad

modelscope/3D-Speaker

bbqsrc/cargo-ndk

llohse/libnpy

csukuangfj/kaldifeat

Le-Xiaohuai-speech/DPCRN_DNS3

GAMMA-UMD/pygsound

bycloudai/SwapCudaVersionWindows

cpuimage/resampler

chirlu/soxr

unilight/seq2seq-vc

sungwon23/BSRNN

csukuangfj/kaldi-native-fbank

microsoft/SIG-Challenge

Okrio/CRUSE

tencent-ailab/UltraDualPathCompression

echocatzh/GFTNN

enhancer12/TSPNN

uw-x/AcousticSwarms-Speech

LMSAudio/Complex_PF

ShiftMediaProject/soxr

breizhn/sms_wsj