Pinned Repositories
4d_reconstruction
4DFM
4D Facial Expression Diffusion Model
ACF_GPU
GPU version of ACF pedestrian detection
Acoustic-feedback-detection
Implementation of an algorithm to detect acoustic feedback from a audio file
AEC
AEC-Challenge
AEC Challenge
books
技术书籍
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,近30万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
opus
Modern audio compression for the internet.
Test
这是个测试项目
zhongshijun's Repositories
zhongshijun/APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
zhongshijun/Bayesian-Pitch-Tracking-Using-Harmonic-model
Robust bayesian pitch tracking based on the harmonic model
zhongshijun/beat-tracking-tcn
An implementation of Davies & Böck's beat-tracking temporal convolutional network
zhongshijun/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
zhongshijun/CDiffuSE
Conditional Diffusion Probabilistic Model for Speech Enhancement
zhongshijun/DBT-Net
The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement" are provided (submitted to TASLP). The code will also be released soon.
zhongshijun/deep-non-linear-filter
zhongshijun/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
zhongshijun/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder archi
zhongshijun/Design-Patterns-GoF
Working code for the Design Patterns book from the Gang of Four.
zhongshijun/Double-talk-Detection-aided-Residual-Echo-Suppression-via-Spectrogram-Masking-and-Refinement
Double-Talk Detection-Aided Residual Echo Suppression via Spectrogram Masking and Refinement
zhongshijun/dparn
zhongshijun/gemmlowp
Low-precision matrix multiplication
zhongshijun/HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
zhongshijun/McNet
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023
zhongshijun/MetaAF
Control adaptive filters with neural networks.
zhongshijun/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
zhongshijun/NAS_VAD
zhongshijun/NeuralSpeech
zhongshijun/NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
zhongshijun/RAVDESS2Mix
zhongshijun/resonance-audio
Resonance Audio Source Code
zhongshijun/RIR-Generator
Generating room impulse responses C++
zhongshijun/Sight-Singing-Vocal-Data
zhongshijun/sonic
Simple library to speed up or slow down speech
zhongshijun/speakerbeam
zhongshijun/SR_for_InRoom_Comm
Speech Reinforcement for In-Room Communications
zhongshijun/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
zhongshijun/TONet
The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"
zhongshijun/webrtc-resampler
resampler module from WebRTC