Pinned Repositories
EffTrans_Fsdet
This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"
HA-DPO-video
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
MLLM-DataEngine
MLLM-DataEngine: a novel closed-loop system that bridges data generation, model training, and evaluation.
mmflow
OpenMMLab optical flow toolbox and benchmark
pretrain_layout
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
VIGC
Visual Instruction Generation and Correction
visual-chatgpt
VisualChatGPT
JulioZhao97's Repositories
JulioZhao97/EffTrans_Fsdet
This is a repository for ACMMM22 paper "Exploring Effective Knowledge Transfer for Few-shot Object Detection"
JulioZhao97/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
JulioZhao97/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
JulioZhao97/VIGC
Visual Instruction Generation and Correction
JulioZhao97/HA-DPO-video
JulioZhao97/MLLM-DataEngine
MLLM-DataEngine: a novel closed-loop system that bridges data generation, model training, and evaluation.
JulioZhao97/mmflow
OpenMMLab optical flow toolbox and benchmark
JulioZhao97/pretrain_layout
JulioZhao97/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
JulioZhao97/visual-chatgpt
VisualChatGPT