yuhangzang

Shanghai AI LaboratoryShanghai

Pinned Repositories

InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k 43 400159
assets
Language:Python0 1 00
CascadeMatch
1 2 20
ContextDET
Contextual Object Detection with Multimodal Large Language Models
Language:Python210 14 95
FASA
Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)
Language:Python29 1 30
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Language:Python0 0 00
on-device-dg
On-Device Domain Generalization
Language:Python0 0 00
OV-DETR
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
Language:Python213 5 2622
UPT
56 8 101
yuhangzang.github.io
Language:HTML1 1 00

yuhangzang's Repositories

yuhangzang/OV-DETR
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
Language:Python213 5 2622
yuhangzang/ContextDET
Contextual Object Detection with Multimodal Large Language Models
Language:Python210 14 95
yuhangzang/UPT
56 8 101
yuhangzang/FASA
Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)
Language:Python29 1 30
yuhangzang/CascadeMatch
1 2 20
yuhangzang/yuhangzang.github.io
Language:HTML1 1 00
yuhangzang/assets
Language:Python0 1 00
yuhangzang/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Language:Python0 0 00
yuhangzang/on-device-dg
On-Device Domain Generalization
Language:Python0 0 00
yuhangzang/yuhangzang
0 1 00