tonghe90's Stars
baaivision/Emu3
Next-Token Prediction is All You Need
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
THUDM/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Stability-AI/generative-models
Generative Models by Stability AI
zdchan/GraspXL
This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.
zju3dv/DiffPano
[NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
zju3dv/FedSurfGS
FedSurfGS: Scalable 3D Surface Gaussian Splatting with Federated Learning for Large Scene Reconstruction
zju3dv/DATAP-SfM
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Dense Structure from Motion in the Wild
HaoyiZhu/RealRobot
Open-source implementations on real robots
lpiccinelli-eth/UniDepth
Universal Monocular Metric Depth Estimation
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
facebookresearch/vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
apple/ml-4m
4M: Massively Multimodal Masked Modeling
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Open3DVLab/GigaGS
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction
Open3DVLab/NeuRodin
NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
HaoyiZhu/PointCloudMatters
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
IntelLabs/MMPano
Official implementation of L-MAGIC
chaytonmin/Awesome-Papers-World-Models-Autonomous-Driving
Awesome Papers about World Models in Autonomous Driving
homangab/Track-2-Act
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
taichi-dev/games201
Advanced Physics Engines 2020: A Hands-on Tutorial
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
ActiveVisionLab/porf
(ICLR 2024) PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction
JonathonLuiten/Dynamic3DGaussians