tonghe90's Stars
Stability-AI/generative-models
Generative Models by Stability AI
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
JonathonLuiten/Dynamic3DGaussians
baaivision/Emu3
Next-Token Prediction is All You Need
apple/ml-4m
4M: Massively Multimodal Masked Modeling
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
facebookresearch/vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
lpiccinelli-eth/UniDepth
Universal Monocular Metric Depth Estimation
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
taichi-dev/games201
Advanced Physics Engines 2020: A Hands-on Tutorial
ActiveVisionLab/porf
(ICLR 2024) PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction
IntelLabs/MMPano
Official implementation of L-MAGIC
Open3DVLab/NeuRodin
[NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
zdchan/GraspXL
This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.
Open3DVLab/GigaGS
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction
chaytonmin/Awesome-Papers-World-Models-Autonomous-Driving
Awesome Papers about World Models in Autonomous Driving
homangab/Track-2-Act
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
HaoyiZhu/PointCloudMatters
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
zju3dv/DATAP-SfM
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Dense Structure from Motion in the Wild
HaoyiZhu/RealRobot
Open-source implementations on real robots
zju3dv/DiffPano
[NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
zju3dv/FedSurfGS
FedSurfGS: Scalable 3D Surface Gaussian Splatting with Federated Learning for Large Scene Reconstruction