wangshuo182's Stars
pyenv/pyenv
Simple Python version management
yuliskov/SmartTube
SmartTube - an advanced player for set-top boxes and tvs running Android OS
audacity/audacity
Audio Editor
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
spotify/pedalboard
🎛 🔊 A Python library for audio.
mattdiamond/Recorderjs
A plugin for recording/exporting the output of Web Audio API nodes
xiph/opus
Modern audio compression for the internet.
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
xiph/LPCNet
Efficient neural speech synthesis
slaypni/fastdtw
A Python implementation of FastDTW
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
mblondel/soft-dtw
Python implementation of soft-DTW.
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
wesbz/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
sp-uhh/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
bfs18/rfwave
TeamPyOgg/PyOgg
Simple OGG Vorbis, Opus and FLAC bindings for Python
yuguochencuc/BAE-Net
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
microsoft/NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
caoruitju/RUI_SE
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
elevenlabs/opuspy
Opus codec support for Python.
CaA23187/TriU-Net-module
PyTorch Implement of TriU-Net