Pinned Repositories
athena-signal
auditok
An audio/acoustic activity detection and audio segmentation tool
auraloss
Collection of audio-focused loss functions in PyTorch
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
ChatTTS
A generative speech model for daily dialogue.
CMGAN
Conformer-based Metric GAN for speech enhancement
orbslam2-with-LK-optical-flow
This project is modified from orbslam2. All dependencies are consistent with orbslam2
Python-for-Signal-Processing
Notebooks for "Python for Signal Processing" book
rnnoise
Recurrent neural network for audio noise reduction
Ziyi6's Repositories
Ziyi6/auditok
An audio/acoustic activity detection and audio segmentation tool
Ziyi6/auraloss
Collection of audio-focused loss functions in PyTorch
Ziyi6/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Ziyi6/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
Ziyi6/ChatTTS
A generative speech model for daily dialogue.
Ziyi6/CMGAN
Conformer-based Metric GAN for speech enhancement
Ziyi6/icefall
Ziyi6/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Ziyi6/lhotse
Tools for handling speech data in machine learning projects.
Ziyi6/LMS-Filter
Ziyi6/DBT-Net
The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement" are provided (submitted to TASLP). The code will also be released soon.
Ziyi6/DCCRN-with-various-loss-functions
DCCRN with various loss functions
Ziyi6/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyT
Ziyi6/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Ziyi6/DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
Ziyi6/english-wordlists
常用英语词汇表
Ziyi6/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Ziyi6/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Ziyi6/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Ziyi6/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Ziyi6/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Ziyi6/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Ziyi6/SF-Net
The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"
Ziyi6/Shenlan
Ziyi6/sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, etc.
Ziyi6/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Ziyi6/Sixty-years-of-frequency-domain-monaural-speech-enhancement
Ziyi6/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Ziyi6/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Ziyi6/you-get
:arrow_double_down: Dumb downloader that scrapes the web