wusize

PhD student@NTU

NTUSingapore

Pinned Repositories

CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Language:Python00
CLIM
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
Language:Python24 1 23
CLIP
Language:Jupyter Notebook0 1 00
CLIPSelf
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
Language:Python145 6 247
colorization
This is the code of the colorization project of the National Innovation Program.
Language:Python1 1 00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
F-LMM
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Language:Python60
multiview_pose
[ICCV2021] Code Release of Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
Language:Python42 4 47
ovdet
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
Language:Python167 7 434
wusize.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript2 0 01

wusize's Repositories

wusize/ovdet
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
Language:Python167 7 434
wusize/CLIPSelf
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
Language:Python145 6 247
wusize/multiview_pose
[ICCV2021] Code Release of Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
Language:Python42 4 47
wusize/CLIM
[AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation
Language:Python24 1 23
wusize/F-LMM
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Language:Python60
wusize/wusize.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript2 0 01
wusize/colorization
This is the code of the colorization project of the National Innovation Program.
Language:Python1 1 00
wusize/CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Language:Python00
wusize/CLIP
Language:Jupyter Notebook0 1 00
wusize/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
wusize/LLaVA-Grounding
Language:Python0 0 00
wusize/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python0 0 00
wusize/open_clip-1
An open source implementation of CLIP.
Language:Python0 0 00
wusize/OVD_Contest
Language:Python0 1 02
wusize/RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
Language:Python0 0 00
wusize/SAN
Open-vocabulary Semantic Segmentation
Language:Python0 0 00
wusize/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
Language:Python0 0
wusize/Visual-CoT
Visual CoT: Unleashing Chain-of-Thought Reasoning in the Multi-Modal Language Model
Language:Python0 0

wusize

Pinned Repositories

CAT-Seg

CLIM

CLIP

CLIPSelf

colorization

DeepSpeed

F-LMM

multiview_pose

ovdet

wusize.github.io

wusize's Repositories

wusize/ovdet

wusize/CLIPSelf

wusize/multiview_pose

wusize/CLIM

wusize/F-LMM

wusize/wusize.github.io

wusize/colorization

wusize/CAT-Seg

wusize/CLIP

wusize/DeepSpeed

wusize/LLaVA-Grounding

wusize/lmms-eval

wusize/open_clip-1

wusize/OVD_Contest

wusize/RegionCLIP

wusize/SAN

wusize/UNINEXT

wusize/Visual-CoT