leexinhao

I am a MS student at Nanjing University. My research interests mainly lie in efficient video/image understanding and generation methods.

SenseTimeNanjing

Pinned Repositories

leexinhao.github.io
Language:Less0 1 00
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python0 0 00
mmaction2-next
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python0 0 00
VideoEval
A vision-centric evaluation method for video foundation models that is comprehensive, challenging, indicative, and low-cost.
Language:Python30
ZeroI2V
Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video" (ECCV2024)
Language:Python19 5 70
LLaVA-NeXT
Language:Python3.2k 37 331278
VideoEval
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
Language:Python6 1 00
ZeroI2V
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
Language:Python16 2 00
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python2.2k 37 144184
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.5k 27 20091

leexinhao/ZeroI2V
Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video" (ECCV2024)
Language:Python19 5 70
leexinhao/VideoEval
A vision-centric evaluation method for video foundation models that is comprehensive, challenging, indicative, and low-cost.
Language:Python30
leexinhao/leexinhao.github.io
Language:Less0 1 00
leexinhao/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python0 0 00
leexinhao/mmaction2-next
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python0 0 00