morbi25's Stars
black-forest-labs/flux
Official inference repo for FLUX.1 models
voxel51/fiftyone
Refine high-quality datasets and visual AI models
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
apple/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
baaivision/Emu3
Next-Token Prediction is All You Need
NVlabs/InstantSplat
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
HengyiWang/spann3r
3D Reconstruction with Spatial Memory
Junyi42/monst3r
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
baaivision/Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
cvg/GeoCalib
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
CompVis/depth-fm
DepthFM: Fast Monocular Depth Estimation with Flow Matching
cvlab-kaist/CAT-Seg
Official Implementation of "CAT-Segš±: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
ywyue/FiT3D
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
wysoczanska/clip_dinoiser
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
hwanhuh/Radiance-Fields-from-VGGSfM-Mast3r
Gaussian Splatting from VGGSfM and Mast3r, and their comparison
markomih/SplatFields
[ECCV 2024] SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction
JiangWenPL/FisherRF
[ECCV'24] FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information
huanngzh/EpiDiff
[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
nv-dvl/segment-anything-lidar
[ECCV 2024] Better Call SAL: Towards Learning to Segment Anything in Lidar
LeapLabTHU/Segment3D
engineerJPark/LiDARWeather
[ECCV 2024 Oral] Official code of "Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather".
hammoudhasan/DiversitySSL
kaschube-lab/LinkingInStyle