luxiaolululu's Stars
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
facebookresearch/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
yangdongchao/UniAudio
The Open Source Code of UniAudio
Audio-WestlakeU/NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Okrio/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
Audio-WestlakeU/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
dundee/gdu
Fast disk usage analyzer with console interface written in Go
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
pleomax0730/Dereverb_MetricGAN-U
jdonley/Speech-Dereverberation-and-RIR-Estimation
NVIDIA/CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
apple/ml-nvas3d
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Audio-WestlakeU/RVAE-EM
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Rudrabha/Lip2Wav
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code.
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
haoheliu/voicefixer
General Speech Restoration
adobe-research/MetaAF
Control adaptive filters with neural networks.
nay0648/unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
fgnt/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
sp-uhh/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation