Uason-Chen
PhD candidate at the Institute of Automation, Chinese Academy of Sciences.
CASIABeijing, China
Pinned Repositories
T-MASS-text-video-retrieval
Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"
Table-LLaVA
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.
apex
Awesome-Skeleton-based-Action-Recognition
Skeleton-based Action Recognition
CTR-GCN
[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"
Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
SGP-JCA
The codebase for SGP-JCA
SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
SlowFast_git
edit slowfast
VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
Uason-Chen's Repositories
Uason-Chen/CTR-GCN
[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"
Uason-Chen/SGP-JCA
The codebase for SGP-JCA
Uason-Chen/Awesome-Skeleton-based-Action-Recognition
Skeleton-based Action Recognition
Uason-Chen/apex
Uason-Chen/Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
Uason-Chen/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Uason-Chen/SlowFast_git
edit slowfast
Uason-Chen/VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022