zjr2000

Master student at SUSTech, ShenZhen, China. My research focuses on Computer Vision, specifically exploring the intersection of vision and language learning.

Southern University of Science and TechnologyShen Zhen

Pinned Repositories

Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.7k 15 24105
Awesome-Multimodal-Chatbot
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a seamless and versatile user experience.
74 4 27
Context-GEBC
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
Language:Python4 2 01
easyexcel
快速、简单避免OOM的java处理Excel工具
Language:Java0 0 00
GVL
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Language:Python27 2 106
LLMVA-GEBC
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Language:Python29 1 42
projects
Language:JavaScript0 1 00
REVERIE
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Language:Python14 2 00
Untrimmed-Video-Feature-Extractor
A simple and effective feature extractor for untrimmed videos
Language:Jupyter Notebook13 2 01
zjr2000
0 1 00

zjr2000's Repositories

zjr2000/Awesome-Multimodal-Chatbot
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a seamless and versatile user experience.
74 4 27
zjr2000/LLMVA-GEBC
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Language:Python29 1 42
zjr2000/GVL
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Language:Python27 2 106
zjr2000/REVERIE
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Language:Python14 2 00
zjr2000/Untrimmed-Video-Feature-Extractor
A simple and effective feature extractor for untrimmed videos
Language:Jupyter Notebook13 2 01
zjr2000/Context-GEBC
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
Language:Python4 2 01
zjr2000/easyexcel
快速、简单避免OOM的java处理Excel工具
Language:Java0 0 00
zjr2000/projects
Language:JavaScript0 1 00
zjr2000/zjr2000
0 1 00
zjr2000/zjr2000.github.io
1 0

zjr2000

Pinned Repositories

Caption-Anything

Awesome-Multimodal-Chatbot

Context-GEBC

easyexcel

GVL

LLMVA-GEBC

projects

REVERIE

Untrimmed-Video-Feature-Extractor

zjr2000

zjr2000's Repositories

zjr2000/Awesome-Multimodal-Chatbot

zjr2000/LLMVA-GEBC

zjr2000/GVL

zjr2000/REVERIE

zjr2000/Untrimmed-Video-Feature-Extractor

zjr2000/Context-GEBC

zjr2000/easyexcel

zjr2000/projects

zjr2000/zjr2000

zjr2000/zjr2000.github.io