Pinned Repositories
AEC-Challenge
AEC Challenge
asteroid_c
bark
🔊 Text-Prompted Generative Audio Model
code
complexPyTorch
A high-level toolbox for using complex valued neural networks in PyTorch
Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
DeepFilterNet
Noise supression using deep filtering
distant_speech_recognition
DSR图书配套 子代波束 后置滤波 spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
ouleiwa's Repositories
ouleiwa/asteroid_c
ouleiwa/bark
🔊 Text-Prompted Generative Audio Model
ouleiwa/code
ouleiwa/complexPyTorch
A high-level toolbox for using complex valued neural networks in PyTorch
ouleiwa/Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
ouleiwa/Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
ouleiwa/DeepFilterNet
Noise supression using deep filtering
ouleiwa/eeee
111
ouleiwa/eng-practices
谷歌工程实践文档 - https://jimmysong.io/eng-practices
ouleiwa/F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
ouleiwa/hello-world
it is a simple code.
ouleiwa/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
ouleiwa/joint_AEC_BSE
ouleiwa/KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
ouleiwa/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
ouleiwa/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
ouleiwa/music-separator
atan2 for onnx
ouleiwa/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
ouleiwa/NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
ouleiwa/PercepNet
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
ouleiwa/SDDNet
Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.
ouleiwa/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
ouleiwa/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
ouleiwa/SKIP-DPCRN
ouleiwa/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
ouleiwa/test
test msysgit
ouleiwa/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ouleiwa/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
ouleiwa/Uformer
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
ouleiwa/w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks