tonghe90

Shanghai AI lab

tonghe90's Stars

Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.8k 258 3112.7k
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.5k 124 434898
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
3.3k 41 4298
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Language:Python2.1k 32 2791
JonathonLuiten/Dynamic3DGaussians
Language:Python2k 94 42126
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 33 4775
apple/ml-4m
4M: Massively Multimodal Masked Modeling
Language:Python1.6k 34 2597
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Language:Python1.5k 27 178111
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Language:Python1.3k 50 3571
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Language:Python1.1k 47 3943
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1k 14 4645
facebookresearch/vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Language:Python938 33 7172
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
843 111 010
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
Language:Python786 60 4349
lpiccinelli-eth/UniDepth
Universal Monocular Metric Depth Estimation
Language:Python647 15 7052
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Language:Python637 6 2330
taichi-dev/games201
Advanced Physics Engines 2020: A Hands-on Tutorial
Language:Python542 20 140
ActiveVisionLab/porf
(ICLR 2024) PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction
Language:Python128 8 87
IntelLabs/MMPano
Official implementation of L-MAGIC
Language:Python124 3 55
Open3DVLab/NeuRodin
[NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
Language:Python112 10 66
zdchan/GraspXL
This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.
Language:C++101 6 77
Open3DVLab/GigaGS
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction
98 23 23
chaytonmin/Awesome-Papers-World-Models-Autonomous-Driving
Awesome Papers about World Models in Autonomous Driving
67 5 13
homangab/Track-2-Act
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
Language:Python64 1 54
HaoyiZhu/PointCloudMatters
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Language:Python50 2 31
zju3dv/DATAP-SfM
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Dense Structure from Motion in the Wild
35 18 11
HaoyiZhu/RealRobot
Open-source implementations on real robots
Language:Python27 3 00
zju3dv/DiffPano
[NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
22 13 00
zju3dv/FedSurfGS
FedSurfGS: Scalable 3D Surface Gaussian Splatting with Federated Learning for Large Scene Reconstruction
14 9 10

tonghe90

tonghe90's Stars

Stability-AI/generative-models

facebookresearch/segment-anything-2

THUDM/CogVideo

ahmetbersoz/chatgpt-prompts-for-academic-writing

buaacyw/MeshAnything

JonathonLuiten/Dynamic3DGaussians

baaivision/Emu3

apple/ml-4m

YvanYin/Metric3D

TencentARC/MotionCtrl

XPandora/PhysGaussian

showlab/Show-o

facebookresearch/vggsfm

CLAY-3D/OpenCLAY

YuelangX/Gaussian-Head-Avatar

lpiccinelli-eth/UniDepth

buoyancy99/diffusion-forcing

taichi-dev/games201

ActiveVisionLab/porf

IntelLabs/MMPano

Open3DVLab/NeuRodin

zdchan/GraspXL

Open3DVLab/GigaGS

chaytonmin/Awesome-Papers-World-Models-Autonomous-Driving

homangab/Track-2-Act

HaoyiZhu/PointCloudMatters

zju3dv/DATAP-SfM

HaoyiZhu/RealRobot

zju3dv/DiffPano

zju3dv/FedSurfGS