pipixin321

Focus in Video Understanding.

HUST(Huazhong University of Science and Technology)Wuhan

pipixin321's Stars

OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k 51 561439
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k 36 226247
LLaVA-VL/LLaVA-NeXT
Language:Python2.5k 32 243188
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k 41 387153
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.4k 25 267134
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.3k 30 229171
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k 26 46107
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python804 38 4542
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python755 9 8251
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python651 8 3936
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python568 13 7437
NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Language:Python507 31 1643
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
387 6 710
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
Language:Python382 16 235
EvolvingLMMs-Lab/LongVA
Long Context Transfer from Language to Vision
Language:Python298 8 2216
OpenGVLab/MM-Interleaved
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Language:Python194 4 711
showlab/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
Language:Python190 7 3525
TIGER-AI-Lab/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Language:Python164 8 1914
JUNJIE99/MLVU
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
Language:Python139 4 40
AI-Study-Han/Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
Language:Python121 3 1020
RifleZhang/LLaVA-Hound-DPO
Language:Python112 5 1414
42Shawn/LLaVA-PruMerge
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Language:Python93 2 155
scenarios/WeMM
Language:Python84 4 711
ChartMimic/ChartMimic
ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
Language:Python83 6 20
pipixin321/HolmesVAD
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
Language:Python67 4 122
whwu95/FreeVA
FreeVA: Offline MLLM as Training-Free Video Assistant
Language:Python43 2 70
Tencent/AnomalyDetection_Real-IAD
Language:Python27 1 11
AI-Study-Han/Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
Language:Python26 1 15
rohit901/VANE-Bench
Contains code and documentation for our VANE-Bench paper.
Language:Python10 1 01
syp2ysy/Arcana
Language:Python4 1 01

pipixin321

pipixin321's Stars

OpenGVLab/InternVL

OpenGVLab/Ask-Anything

LLaVA-VL/LLaVA-NeXT

InternLM/InternLM-XComposer

QwenLM/Qwen2-VL

X-PLUG/mPLUG-Owl

facebookresearch/chameleon

VITA-MLLM/VITA

DAMO-NLP-SG/VideoLLaMA2

GAIR-NLP/anole

magic-research/PLLaVA

NVlabs/EAGLE

showlab/Awesome-MLLM-Hallucination

RunpeiDong/DreamLLM

EvolvingLMMs-Lab/LongVA

OpenGVLab/MM-Interleaved

showlab/videollm-online

TIGER-AI-Lab/Mantis

JUNJIE99/MLVU

AI-Study-Han/Zero-Chatgpt

RifleZhang/LLaVA-Hound-DPO

42Shawn/LLaVA-PruMerge

scenarios/WeMM

ChartMimic/ChartMimic

pipixin321/HolmesVAD

whwu95/FreeVA

Tencent/AnomalyDetection_Real-IAD

AI-Study-Han/Zero-Qwen-VL

rohit901/VANE-Bench

syp2ysy/Arcana