SJTU Cross Media Language Intelligence Lab
to advance the research of intelligent speech and language processing for human machine interaction and develop effective algorithms for real-world applications
Pinned Repositories
AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
StoryTTS
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
text2sql-lgesql
[ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".
UniCATS-CTX-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
WebSRC
[EMNLP 2021] WebSRC: A dataset for web based structural machine reading comprehension.
Xmart
Xmart青年论坛仓库,存放历史学生论坛和前沿讲座的视频回放和讲义,获取最新Xmart预告欢迎关注公众号【XLANCE Lab】
SJTU Cross Media Language Intelligence Lab's Repositories
X-LANCE/ai-deadlines
:alarm_clock: AI conference deadline countdowns
X-LANCE/python-guide
Python best practices guidebook, written for Humans.
X-LANCE/codeprinter
Print out code easily
X-LANCE/speechlab-sjtu.github.io
Home page for speechlab.