MillX2021

MillX2021's Stars

CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Language:Python648 6 9376
siyuanliii/masa
Official Implementation of CVPR24 paper: Matching Anything by Segmenting Anything
51115
cure-lab/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Language:Python401 17 3220
OpenDriveLab/Vista
A Generalizable World Model for Autonomous Driving
Language:Python320 19 512
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
Language:Python274 2 7142
GitGyun/visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Language:Python248 7 1713
nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
Language:Python244 9 712
JiayuanWang-JW/YOLOv8-multi-task
Language:Python182 2 5628
huang-yh/GaussianFormer
Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
1195
dcmlr/groundgrid
Source code for the article "GroundGrid: LiDAR Point Cloud Ground Segmentation and Terrain Estimation"
Language:C++98 5 06
vignywang/SAMFeat
The official implementation of “Segment Anything Model is a Good Teacher for Local Feature Learning”.
Language:Python98 6 43
swc-17/SparseDrive
862
wzzheng/OccSora
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
Language:Python736
alibaba/conv-llava
Language:Python67 3 33
PJLab-ADG/LeapAD
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
40 4 03
yihedeng9/STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
Language:Python31 3 02
KuanchihHuang/Reason3D
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
301
aim-uofa/DiverGen
DiverGen (CVPR 2024) & BSGAL (ICML 2024)
29 4 10
aminebdj/OpenYOLO3D
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.
Language:Python240
MaxZanella/CLIP-LoRA
[CVPRW 2024] Low-Rank Adaptation for few-shot Vision-Language Models (CLIP-LoRA): a strong baseline without hyperparameter tuning.
Language:Python22
importZL/BLO-SAM
Language:Python19 2 11
HanchenTai/OV-SAM3D
Open-Vocabulary SAM3D: Understand Any 3D Scene
15
marcointrovigne/WeatherDetection
Language:Python120
nini0919/SemiRES
The official implementation of SemiRES in PyTorch.
Language:Python10
yuhui-zh15/VLMClassifier
Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?"
Language:Python81
SLDGroup/PP-SAM
71
Duojun-Huang/AlignSAM-CVPR2024
Pytorch official implementation for our CVPR-2024 paper "AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning".
6
qinzheng2000/GeneralTrack
Language:Python51
SegoleneMartin/transductive-CLIP
Language:Python51
xiaomoguhz/OV-DQUO
Language:Python51