runngezhang

Pinned Repositories

3D-Sound-Localization
Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.
Language:Python1 0 00
abcjs
javascript for rendering abc music notation
Language:HTML1 0 00
audio_video_streaming
音视频流媒体权威资料整理，500+份文章，论文，视频，实践项目，协议，业界大神名单。
1 0 00
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
Language:Python3 0 04
deep-learning-models
Keras code and weights files for popular deep learning models.
Language:Python1 2 00
DNN-for-speech-enhancement
DNN-for-speech-enhancement
Language:C++1 1 00
Echo-Debar-AEC
Submission to the 1st Acoustic Echo Cancellation Challenge, Microsoft and ICASSP 2021.
Language:MATLAB1 0 00
graph-neural-networks
Library to implement graph neural networks in PyTorch
Language:Python1 0 00
ISCLP-KF
Integrated sidelobe cancellation and linear prediction Kalman filter for joint multi-microphone speech dereverberation, interfering speech cancellation, and noise reduction.
Language:MATLAB1 0 01
MetaAF
Control adaptive filters with neural networks.
Language:Python1 0 00

runngezhang's Repositories

runngezhang/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
Language:Python0 0
runngezhang/AP-BWE
Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction
0 0
runngezhang/APNet2
Source code of APNet2, a vocoder
Language:Python0 0
runngezhang/audio-transformers-course
The Hugging Face Course on Transformers for Audio
Language:MDX0 0
runngezhang/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0
runngezhang/audiowmark
Audio Watermarking
Language:C++0 0
runngezhang/Awesome-state-space-models
Collection of papers on state-space models
0 0
runngezhang/CRUSE
a lightweight network for monaural speech enhancement
Language:Python0 0
runngezhang/datasets_musicDetect
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Language:Python0 0
runngezhang/deep-non-linear-filter
Language:Python0 0
runngezhang/DeepComplexCRN
Language:HTML0 0
runngezhang/docs-l10n
Translations of TensorFlow documentation
Language:Jupyter Notebook0 0
runngezhang/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0
runngezhang/faiss
A library for efficient similarity search and clustering of dense vectors.
Language:C++0 0
runngezhang/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0
runngezhang/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0
runngezhang/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python0 0
runngezhang/LVM
0 0
runngezhang/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
runngezhang/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
Language:Python0 0
runngezhang/scikit-image
Image Processing SciKit (Toolbox for SciPy)
Language:Python0 0
runngezhang/seanet
Language:HTML0 0
runngezhang/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python0 0
runngezhang/speechbrain
A PyTorch-based Speech Toolkit
Language:Python
runngezhang/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Language:Python0 0
runngezhang/TCN
Sequence modeling benchmarks and temporal convolutional networks
Language:Python1 0
runngezhang/tianya-docs
精心收集的天涯神贴，不带水印，方便阅读
0 0
runngezhang/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/粘贴/批量导入图片，段落排版/排除水印，扫描/生成二维码。内置多国语言库。
runngezhang/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
Language:Python0 0
runngezhang/voicefixer
General Speech Restoration
Language:Python0 0