Pinned Repositories
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Adversarial_Ali
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
uestcMeng's Repositories
uestcMeng doesn’t have any repository yet.