Pinned Repositories
DocLayout-YOLO
DocLayout-YOLO: An effecient and robust Document Layout Analysis method
EffTrans_Fsdet
This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"
GOT_vllm-formula
HA-DPO-video
ICCV25_1968
JulioZhao97
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
MLLM-DataEngine
MLLM-DataEngine: a novel closed-loop system that bridges data generation, model training, and evaluation.
VIGC
Visual Instruction Generation and Correction
JulioZhao97's Repositories
JulioZhao97/EffTrans_Fsdet
This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"
JulioZhao97/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
JulioZhao97/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
JulioZhao97/VIGC
Visual Instruction Generation and Correction
JulioZhao97/DocLayout-YOLO
DocLayout-YOLO: An effecient and robust Document Layout Analysis method
JulioZhao97/HA-DPO-video
JulioZhao97/JulioZhao97
JulioZhao97/MLLM-DataEngine
MLLM-DataEngine: a novel closed-loop system that bridges data generation, model training, and evaluation.
JulioZhao97/mmflow
OpenMMLab optical flow toolbox and benchmark
JulioZhao97/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
JulioZhao97/visual-chatgpt
VisualChatGPT