rikeilong
Graduate student. Research area: Action Recognition, Multimodal Large Language Models.
Great Bay UniversityDongguan, China
Pinned Repositories
flash-attention
Fast and memory-efficient exact attention
Bay-CAT
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
MCD-forAVQA
Official Implementation for Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
Ppromo-IAR
Official Implementation for Pose-promote: Progressive Visual Perception for Indoor Action Recognition
rikeilong
rikeilong's Repositories
rikeilong/Bay-CAT
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
rikeilong/MCD-forAVQA
Official Implementation for Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
rikeilong/Ppromo-IAR
Official Implementation for Pose-promote: Progressive Visual Perception for Indoor Action Recognition
rikeilong/rikeilong