zqh0253

Ph.D. at MMLab, CUHK; B.Eng. at ZJU.

zqh0253's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python15.9k 143 1561.2k
lllyasviel/IC-Light
More relighting!
Language:Python5.5k 54 89361
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Language:Python5.4k 59 55445
luigifreda/pyslam
pySLAM contains a Visual Odometry (VO) pipeline in Python for monocular, stereo and RGBD cameras. It supports many modern local features based on Deep Learning.
Language:Python1.9k 44 104336
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.8k 30 4271
colmap/glomap
GLOMAP - Global Structured-from-Motion Revisited
Language:C++1.5k 25 9294
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Language:Python1.4k 27 170107
naver/mast3r
Grounding Image Matching in 3D with MASt3R
Language:Python1.3k 31 7399
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.3k 22 6056
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
1.3k 110 2552
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1k 14 4244
facebookresearch/vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Language:Python920 32 6568
nerfstudio-project/viser
Web-based 3D visualization + Python
Language:Python835 29 10749
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Language:Python731 59 4225
facebookresearch/PoseDiffusion
[ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment
Language:Python716 22 3541
nianticlabs/acezero
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Language:Python654 58 3041
OpenRobotLab/GRUtopia
GRUtopia: Dream General Robots in a City at Scale
Language:Python508 12 2124
apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
Language:Python438 13 1628
hehao13/CameraCtrl
Language:Python435 12 1619
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python271 13 86
facebookresearch/lightplane
Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.
Language:Python262 27 48
cwchenwang/awesome-4d-generation
List of papers on 4D Generation.
181 8 21
zqh0253/3DitScene
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Language:Python179 5 138
galeselee/Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!
175 4 07
hwanhuh/Radiance-Fields-from-VGGSfM-Mast3r
Gaussian Splatting from VGGSfM and Mast3r, and their comparison
Language:Python175 5 44
tyhuang0428/DreamPhysics
DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians from Video Diffusion Priors
Language:Python167 4 24
customdiffusion360/custom-diffusion360
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Language:Python151 5 16
ubc-vision/vivid123
[CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models
Language:Python147 12 48
MegaScenes/dataset
65 3 20
jihaonew/UTA
Enhancing Vision-Language Model with Unmasked Token Alignment (TMLR)
Language:Python9 2 20

zqh0253

zqh0253's Stars

black-forest-labs/flux

lllyasviel/IC-Light

Doubiiu/ToonCrafter

luigifreda/pyslam

baaivision/Emu3

colmap/glomap

YvanYin/Metric3D

naver/mast3r

FoundationVision/LlamaGen

menyifang/MIMO

showlab/Show-o

facebookresearch/vggsfm

nerfstudio-project/viser

henry123-boy/SpaTracker

facebookresearch/PoseDiffusion

nianticlabs/acezero

OpenRobotLab/GRUtopia

apple/ml-mdm

hehao13/CameraCtrl

OpenGVLab/OmniCorpus

facebookresearch/lightplane

cwchenwang/awesome-4d-generation

zqh0253/3DitScene

galeselee/Awesome_LLM_System-PaperList

hwanhuh/Radiance-Fields-from-VGGSfM-Mast3r

tyhuang0428/DreamPhysics

customdiffusion360/custom-diffusion360

ubc-vision/vivid123

MegaScenes/dataset

jihaonew/UTA