HelloMagicWorld's Stars
mlfoundations/open_clip
An open source implementation of CLIP.
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
ytongbai/LVM
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
lxtGH/Awesome-Segmentation-With-Transformer
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
richzhang/webpage-template
Simple project webpage template. Originally used in Colorful Image Colorization. ECCV, 2016.
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
spotify-research/llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
xiaoaoran/3d_url_survey
(TPAMI2023) Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey
NiFangBaAGe/Explicit-Visual-Prompt
[CVPR 2023] Explicit Visual Prompting for Low-Level Structure Segmentations
xiaoaoran/SynLiDAR
SynLiDAR: Synthetic LiDAR sequential point cloud dataset with point-wise annotations (AAAI2022)
NJU-LHRS/LHRS-Bot
VGI-Enhanced multimodal large language model for remote sensing images.
xiaoaoran/SemanticSTF
(CVPR 2023) The official project of "3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds"
weihao1115/cat-sam
[ECCV 2024 Oral] The official implementation of "CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model".
xiaoaoran/polarmix
Official PyTorch implementation of the NeurIPS2022 paper "PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds"
xiaoaoran/3D_label_efficient_learning
Official repository for TPAMI2024 "A Survey of Label-Efficient Deep Learning for 3D Point Clouds"
xiaoaoran/FPS-Net
Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".
xing0047/rewrite
[NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
zhangshengjun2019/GeoAuxNet
[CVPR 2024] GeoAuxNet: Torwards Universal 3D Representation Learning for Multi-sensor Point Clouds