KIP1024's Stars
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
bliunlpr/Robust_e2e_gan
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
d2l-ai/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
TowerYsable/ASR_awesome
语音识别 论文 前沿
TowerYsable/learning_review
个人学习笔记
obgnail/typora_plugin
Typora plugin. Feature enhancement tool | Typora 插件,功能增强工具
TheAlgorithms/Python
All Algorithms implemented in Python
halsay/MFCC_tutorial
MFCC implementation with detailed comments.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wenet-e2e/wenet_in_action_homework
WeNet 实战课程作业
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.