Uason-Chen
PhD candidate at the Institute of Automation, Chinese Academy of Sciences.
CASIABeijing, China
Uason-Chen's Stars
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
TencentARC/SEED-Voken
SEED-Voken: A Series of Powerful Visual Tokenizers
niais/Awesome-Skeleton-based-Action-Recognition
Skeleton-based Action Recognition
QwenLM/Qwen2.5-Math
A series of math-specific large language models of our Qwen2 series.
dle666/R-CoT
Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
yangli18/VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
yangbang18/Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
lucasjinreal/MLLM_Factory
A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series models.
Uason-Chen/SGP-JCA
The codebase for SGP-JCA