HelloMagicWorld

HelloMagicWorld's Stars

mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.3k 78 487981
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.3k 98 90767
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7k 49 216537
ytongbai/LVM
Language:Python1.8k 120 2254
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.7k 24 3995
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
Language:Python1.6k 19 316177
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Language:Python1k 19 3863
Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models
938 42 2387
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
830 20 753
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python779 31 7437
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Language:Python719 15 6252
lxtGH/Awesome-Segmentation-With-Transformer
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
694 10 547
richzhang/webpage-template
Simple project webpage template. Originally used in Colorful Image Colorization. ECCV, 2016.
Language:HTML462 3 0156
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
Language:Python416 15 1231
spotify-research/llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
Language:Jupyter Notebook304 8 724
xiaoaoran/3d_url_survey
(TPAMI2023) Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey
202 14 120
NiFangBaAGe/Explicit-Visual-Prompt
[CVPR 2023] Explicit Visual Prompting for Low-Level Structure Segmentations
Language:Python189 5 1914
xiaoaoran/SynLiDAR
SynLiDAR: Synthetic LiDAR sequential point cloud dataset with point-wise annotations (AAAI2022)
Language:Python129 13 1511
NJU-LHRS/LHRS-Bot
VGI-Enhanced multimodal large language model for remote sensing images.
Language:Python104 4 288
xiaoaoran/SemanticSTF
(CVPR 2023) The official project of "3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds"
Language:Python104 4 913
weihao1115/cat-sam
[ECCV 2024 Oral] The official implementation of "CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model".
Language:Python99 5 67
xiaoaoran/polarmix
Official PyTorch implementation of the NeurIPS2022 paper "PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds"
Language:Python65 6 711
xiaoaoran/3D_label_efficient_learning
Official repository for TPAMI2024 "A Survey of Label-Efficient Deep Learning for 3D Point Clouds"
50 1 01
xiaoaoran/FPS-Net
Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".
Language:Python21 4 86
xing0047/rewrite
[NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
Language:Python20 2 00
zhangshengjun2019/GeoAuxNet
[CVPR 2024] GeoAuxNet: Torwards Universal 3D Representation Learning for Multi-sensor Point Clouds
Language:Python8 1 10