MillX2021's Stars
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
siyuanliii/masa
Official Implementation of CVPR24 paper: Matching Anything by Segmenting Anything
cure-lab/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
OpenDriveLab/Vista
A Generalizable World Model for Autonomous Driving
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
GitGyun/visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
JiayuanWang-JW/YOLOv8-multi-task
huang-yh/GaussianFormer
Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
dcmlr/groundgrid
Source code for the article "GroundGrid: LiDAR Point Cloud Ground Segmentation and Terrain Estimation"
vignywang/SAMFeat
The official implementation of “Segment Anything Model is a Good Teacher for Local Feature Learning”.
swc-17/SparseDrive
wzzheng/OccSora
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
alibaba/conv-llava
PJLab-ADG/LeapAD
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
yihedeng9/STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
KuanchihHuang/Reason3D
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
aim-uofa/DiverGen
DiverGen (CVPR 2024) & BSGAL (ICML 2024)
aminebdj/OpenYOLO3D
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.
MaxZanella/CLIP-LoRA
[CVPRW 2024] Low-Rank Adaptation for few-shot Vision-Language Models (CLIP-LoRA): a strong baseline without hyperparameter tuning.
importZL/BLO-SAM
HanchenTai/OV-SAM3D
Open-Vocabulary SAM3D: Understand Any 3D Scene
marcointrovigne/WeatherDetection
nini0919/SemiRES
The official implementation of SemiRES in PyTorch.
yuhui-zh15/VLMClassifier
Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?"
SLDGroup/PP-SAM
Duojun-Huang/AlignSAM-CVPR2024
Pytorch official implementation for our CVPR-2024 paper "AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning".
qinzheng2000/GeneralTrack
SegoleneMartin/transductive-CLIP
xiaomoguhz/OV-DQUO