Shallow2000's Stars
Chuny1/3DGPT
One-2-3-45/One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
guoyww/AnimateDiff
Official implementation of AnimateDiff.
pengbo807/ConditionVideo
Training-Free Condition-Guided Text-to-Video Generation
Stability-AI/generative-models
Generative Models by Stability AI
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
chaytonmin/UniScene
Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
JonathonLuiten/Dynamic3DGaussians
rese1f/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
googleinterns/IBRNet
mit-han-lab/fastcomposer
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
mit-han-lab/offsite-tuning
Offsite-Tuning: Transfer Learning without Full Model
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
FrozenBurning/SceneDreamer
[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
ifzhang/FairMOT
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
shalfun/DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
nutonomy/nuscenes-devkit
The devkit of the nuScenes dataset.
rh20t/rh20t_api
TonyLianLong/LLM-groundedVideoDiffusion
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
xxxnell/spatial-smoothing
(ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness”.