Pinned Repositories
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
HALC
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
OPERA
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Causal-CoG
[CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"
mergekit
Tools for merging pretrained large language models.
rl-tutorials
basic algorithms of reinforcement learning
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
zhaoshitian.github.io
Shitian Zhao's Homepage
zhaoshitian's Repositories
zhaoshitian/Causal-CoG
[CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"
zhaoshitian/rl-tutorials
basic algorithms of reinforcement learning
zhaoshitian/mergekit
Tools for merging pretrained large language models.
zhaoshitian/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
zhaoshitian/zhaoshitian.github.io
Shitian Zhao's Homepage