kevinchiu19

kevinchiu19's Stars

IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.5k 115 3951.4k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.5k 78 4111.3k
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.6k 120 109439
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.6k 62 140483
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.4k 61 96230
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python4.3k 36 213371
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Language:Jupyter Notebook4k 33 114269
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.7k 32 142217
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language:Python2.5k 42 110144
3DTopia/LGM
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language:Python1.8k 32 78123
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Language:Jupyter Notebook1.4k 11 57135
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python1.1k 29 5339
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Language:Python1.1k 46 4356
Junyi42/monst3r
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Language:Python913 37 5652
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
Language:Python659 22 6562
NVlabs/EmerNeRF
PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
Language:Python584 27 3045
nv-tlabs/XCube
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Language:Python398 12 4025
GaussianCube/GaussianCube
[NeurIPS 2024] GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
Language:Python370 12 1919
Jyxarthur/flowsam
[ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman
Language:Python288 4 1521
davyneven/SpatialEmbeddings
Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth
Language:Python216 15 1934
stelzner/srt
Independent PyTorch implementation of Scene Representation Transformer
Language:Python211 11 619
QianyiWu/objsdf
:t-rex: [ECCV‘22] Pytorch implementation of 'Object-Compositional Neural Implicit Surfaces'
Language:Python187 7 145
wzzheng/OccSora
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
Language:Python160 7 178
snap-research/discoscene
CVPR 2023 Highlight: DiscoScene
Language:Python148 25 52
coltonstearns/dynamic-gaussian-marbles
Language:Python137 5 1411
jimazeyu/GraspSplats
GraspSplats: Efficient Manipulation with 3D Feature Splatting
Language:C++92 7 107
lifuguan/GGRt_official
[ECCV 2024] GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time
Language:Python60 8 52
YuLiu-LY/BO-QSA
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
Language:Python50 2 33
singhgautam/steve
Official code for Slot-Transformer for Videos (STEVE)
Language:Python47 2 39
Shamdan17/CarFormer
The official repository for the ECCV2024 paper "CarFormer: Self-Driving with Learned Object-Centric Representations"
Language:Python36 4 23