seanzhuh

what/why then how

CSE @ HKUSTHong Kong, China

seanzhuh's Stars

VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python80741
yixuan730/DetToolChain
Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM
17
scratchapixel/scratchapixel-code
Language:GLSL27862
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.3k968
slothfulxtx/Texture-GS
[ECCV 2024] The official repo for "Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing"
Language:Python1165
ruiqixu37/Nuvo
Personal Implementation of the paper: Nuvo: Neural UV Mapping for Unruly 3D Representations
Language:Python171
HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
976
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
37411
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4k302
Rubics-Xuan/MRES
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
63
baaivision/tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
Language:Jupyter Notebook50319
V3Det/V3Det
Language:Python972
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python47829
bytedance/OmniScient-Model
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
Language:Jupyter Notebook895
MinaGhadimiAtigh/hyperbolic_representation_learning
The repository for Hyperbolic Representation Learning for Computer Vision, ECCV 2022
Language:Jupyter Notebook615
valeoai/Awesome-Unsupervised-Object-Localization
Curated list of awesome works on unsupervised object localization in 2D images.
652
microsoft/SoM
Set-of-Mark Prompting for GPT-4V and LMMs
Language:Python1.1k88
dome272/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Language:Python1.1k260
apple/ml-ferret
Language:Python8.3k485
baaivision/Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Language:Python46629
Paranioar/UniPT
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
Language:Python631
prannaykaul/mm-ovod
Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"
Language:Python777
mlzxy/devit
Language:Python33045
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook8.9k783
OpenGVLab/VisionLLM
VisionLLM Series
Language:Python86421
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Language:Jupyter Notebook35123
witnessai/Awesome-Open-Vocabulary-Object-Detection
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
27517
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python9.9k959
open-mmlab/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Language:Python1.1k122
CamuseCao/XMU-thesis
A LaTeX template
Language:TeX8824

seanzhuh

seanzhuh's Stars

VITA-MLLM/VITA

yixuan730/DetToolChain

scratchapixel/scratchapixel-code

facebookresearch/sam2

slothfulxtx/Texture-GS

ruiqixu37/Nuvo

HKUST-LongGroup/Awesome-Open-Vocabulary-Detection-and-Segmentation

BradyFU/Video-MME

FoundationVision/VAR

Rubics-Xuan/MRES

baaivision/tokenize-anything

V3Det/V3Det

shenyunhang/APE

bytedance/OmniScient-Model

MinaGhadimiAtigh/hyperbolic_representation_learning

valeoai/Awesome-Unsupervised-Object-Localization

microsoft/SoM

dome272/Diffusion-Models-pytorch

apple/ml-ferret

baaivision/Uni3D

Paranioar/UniPT

prannaykaul/mm-ovod

mlzxy/devit

facebookresearch/dinov2

OpenGVLab/VisionLLM

xmed-lab/CLIP_Surgery

witnessai/Awesome-Open-Vocabulary-Object-Detection

mlfoundations/open_clip

open-mmlab/playground

CamuseCao/XMU-thesis