Pinned Repositories
acoustic-interference-cancellation
acoustic interference (echo) cancellation project in summer internship
AI-Expert-Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
annotated_deep_learning_paper_implementations
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
ant-design
An enterprise-class UI design language and React UI library
ASC_baseline
athena-signal
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
webrtc-beamforming
整理出来的webrtc波束模块
webrtc_agc2
demo for webrtc agc2
ctwgL's Repositories
ctwgL/webrtc-beamforming
整理出来的webrtc波束模块
ctwgL/webrtc_agc2
demo for webrtc agc2
ctwgL/AI-Expert-Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2021
ctwgL/annotated_deep_learning_paper_implementations
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
ctwgL/ant-design
An enterprise-class UI design language and React UI library
ctwgL/athena-signal
ctwgL/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
ctwgL/awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
ctwgL/d2l-mindspore
《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。
ctwgL/EmoSphere-TTS
The official implementation of EmoSphere-TTS
ctwgL/espnet
End-to-End Speech Processing Toolkit
ctwgL/evalml
EvalML is an AutoML library written in python.
ctwgL/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
ctwgL/FAcodec
Training code for FAcodec presented in NaturalSpeech3
ctwgL/FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
ctwgL/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
ctwgL/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
ctwgL/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
ctwgL/NeMo
NeMo: a toolkit for conversational AI
ctwgL/papyrus
ctwgL/PHASEN
ctwgL/pytorch-template
PyTorch deep learning projects made easy.
ctwgL/setk
Tools for Speech Enhancement integrated with Kaldi
ctwgL/SpeechAlgorithms
Speech Algorithms Collections
ctwgL/speechbrain
A PyTorch-based Speech Toolkit
ctwgL/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
ctwgL/unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
ctwgL/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
ctwgL/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
ctwgL/webrtc_AEC3