Shallow2000

Shallow2000's Stars

Chuny1/3DGPT
76536
One-2-3-45/One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
Language:Python1.6k84
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.4k856
pengbo807/ConditionVideo
Training-Free Condition-Guided Text-to-Video Generation
Language:Python56
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.4k2.7k
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.5k335
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Language:Python4.7k375
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Language:Python88142
open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
Language:Python5.2k1.5k
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k90
chaytonmin/UniScene
Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving
Language:Python20414
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Language:Python69870
JonathonLuiten/Dynamic3DGaussians
Language:Python1.9k119
rese1f/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Language:Python1.4k88
googleinterns/IBRNet
Language:Python49153
mit-han-lab/fastcomposer
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Language:Python65336
mit-han-lab/offsite-tuning
Offsite-Tuning: Transfer Learning without Full Model
Language:Python36638
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.6k362
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python3.3k538
FrozenBurning/SceneDreamer
[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
Language:Python60841
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Language:Python3.9k351
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
Language:Python60042
ifzhang/FairMOT
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
Language:Python4k935
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
2928
shalfun/DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
Language:Python54614
nutonomy/nuscenes-devkit
The devkit of the nuScenes dataset.
Language:Python2.3k626
rh20t/rh20t_api
Language:Python595
TonyLianLong/LLM-groundedVideoDiffusion
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
Language:Python1237
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Language:Jupyter Notebook1.1k106
xxxnell/spatial-smoothing
(ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness”.
Language:Python777