Pinned Repositories
InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
assets
CascadeMatch
ContextDET
Contextual Object Detection with Multimodal Large Language Models
FASA
Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
on-device-dg
On-Device Domain Generalization
OV-DETR
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
UPT
yuhangzang.github.io
yuhangzang's Repositories
yuhangzang/OV-DETR
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
yuhangzang/ContextDET
Contextual Object Detection with Multimodal Large Language Models
yuhangzang/UPT
yuhangzang/FASA
Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)
yuhangzang/CascadeMatch
yuhangzang/yuhangzang.github.io
yuhangzang/assets
yuhangzang/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
yuhangzang/on-device-dg
On-Device Domain Generalization
yuhangzang/yuhangzang