zhaorw02's Stars
csbench/csbench
OpenDriveLab/AgiBot-World
World's First Large-scale High-quality Robotic Manipulation Benchmark
Seed3D/Dora
Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"
akanyaani/miniLLAMA
A simplified LLAMA implementation for training and inference tasks.
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
deepseek-ai/DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
KwaiVGI/SynCamMaster
[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
huanngzh/MV-Adapter
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
lehduong/OneDiffusion
Official implementation of OneDiffusion paper
1zb/GeomDist
NVlabs/EdgeRunner
EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation
samxuxiang/BrepGen
[SIGGRAPH 2024] Official PyTorch Implementation of "BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry".
nv-tlabs/LLaMA-Mesh
Unifying 3D Mesh Generation with Language Models
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
wenqsun/DimensionX
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
whaohan/bpt
Official code for paper: Scaling Mesh Generation via Compressive Tokenization
Tencent/Hunyuan3D-1
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
audi/MeshGPT
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers
Open3DVLab/NeuRodin
[NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
WHU-USI3DV/VistaDream
[arXiv'24] VistaDream: Sampling multiview consistent images for single-view scene reconstruction
MandiZhao/real2code
microsoft/MoGe
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
NVlabs/InstantSplat
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
aigc-apps/CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
btsmart/splatt3r
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."