Pinned Repositories
3D-Sound-Localization
Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.
abcjs
javascript for rendering abc music notation
audio_video_streaming
音视频流媒体权威资料整理,500+份文章,论文,视频,实践项目,协议,业界大神名单。
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
deep-learning-models
Keras code and weights files for popular deep learning models.
DNN-for-speech-enhancement
DNN-for-speech-enhancement
Echo-Debar-AEC
Submission to the 1st Acoustic Echo Cancellation Challenge, Microsoft and ICASSP 2021.
graph-neural-networks
Library to implement graph neural networks in PyTorch
ISCLP-KF
Integrated sidelobe cancellation and linear prediction Kalman filter for joint multi-microphone speech dereverberation, interfering speech cancellation, and noise reduction.
MetaAF
Control adaptive filters with neural networks.
runngezhang's Repositories
runngezhang/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
runngezhang/AP-BWE
Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction
runngezhang/APNet2
Source code of APNet2, a vocoder
runngezhang/audio-transformers-course
The Hugging Face Course on Transformers for Audio
runngezhang/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
runngezhang/audiowmark
Audio Watermarking
runngezhang/Awesome-state-space-models
Collection of papers on state-space models
runngezhang/CRUSE
a lightweight network for monaural speech enhancement
runngezhang/datasets_musicDetect
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
runngezhang/deep-non-linear-filter
runngezhang/DeepComplexCRN
runngezhang/docs-l10n
Translations of TensorFlow documentation
runngezhang/espnet
End-to-End Speech Processing Toolkit
runngezhang/faiss
A library for efficient similarity search and clustering of dense vectors.
runngezhang/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
runngezhang/flash-attention
Fast and memory-efficient exact attention
runngezhang/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
runngezhang/LVM
runngezhang/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
runngezhang/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
runngezhang/scikit-image
Image Processing SciKit (Toolbox for SciPy)
runngezhang/seanet
runngezhang/so-vits-svc
SoftVC VITS Singing Voice Conversion
runngezhang/speechbrain
A PyTorch-based Speech Toolkit
runngezhang/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
runngezhang/TCN
Sequence modeling benchmarks and temporal convolutional networks
runngezhang/tianya-docs
精心收集的天涯神贴,不带水印,方便阅读
runngezhang/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/粘贴/批量导入图片,段落排版/排除水印,扫描/生成二维码。内置多国语言库。
runngezhang/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
runngezhang/voicefixer
General Speech Restoration