wangshuo182

wangshuo182's Stars

pyenv/pyenv
Simple Python version management
Language:Roff38.8k 382 1.8k3k
yuliskov/SmartTube
SmartTube - an advanced player for set-top boxes and tvs running Android OS
Language:Java18.9k 179 2.6k1k
audacity/audacity
Audio Editor
Language:C12.3k 273 4.2k2.3k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.4k 167 6572.2k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k 133 50858
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k 55 2051.2k
spotify/pedalboard
🎛 🔊 A Python library for audio.
Language:C++5.2k 55 184260
mattdiamond/Recorderjs
A plugin for recording/exporting the output of Web Audio API nodes
Language:JavaScript4.2k 184 1571.5k
xiph/opus
Modern audio compression for the internet.
Language:C2.3k 97 230603
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Language:Jupyter Notebook1.4k 23 75231
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
Language:Python1.3k 18 46134
xiph/LPCNet
Efficient neural speech synthesis
Language:C1.1k 72 197295
slaypni/fastdtw
A Python implementation of FastDTW
Language:Python785 17 35122
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python758 21 47112
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
Language:Cython719 26 57121
mblondel/soft-dtw
Python implementation of soft-DTW.
Language:Python535 28 2698
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language:MATLAB497 25 49127
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook402 17 2655
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python349 15 5230
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Language:Python344 10 1652
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Language:Python291 5 4944
sp-uhh/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Language:Python172 11 2223
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language:Python131 2 612
bfs18/rfwave
Language:Python95 4 48
TeamPyOgg/PyOgg
Simple OGG Vorbis, Opus and FLAC bindings for Python
Language:Python64 5 6527
yuguochencuc/BAE-Net
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Language:Python55 8 102
microsoft/NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
Language:Python42 14 98
caoruitju/RUI_SE
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
Language:Python38 1 38
elevenlabs/opuspy
Opus codec support for Python.
Language:C++25 5 15
CaA23187/TriU-Net-module
PyTorch Implement of TriU-Net
Language:Python3 0 02