KIP1024

KIP1024's Stars

THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python8.6k821
bliunlpr/Robust_e2e_gan
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
Language:Python195
d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python63.4k11k
TowerYsable/ASR_awesome
语音识别论文前沿
436
TowerYsable/learning_review
个人学习笔记
Language:HTML72
obgnail/typora_plugin
Typora plugin. Feature enhancement tool | Typora 插件，功能增强工具
Language:JavaScript1.8k85
TheAlgorithms/Python
All Algorithms implemented in Python
Language:Python194k45.6k
halsay/MFCC_tutorial
MFCC implementation with detailed comments.
Language:Jupyter Notebook163
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.2k1.1k
wenet-e2e/wenet_in_action_homework
WeNet 实战课程作业
Language:Python164
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.9k726