YuxuanSnow's Stars
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
lpiccinelli-eth/UniDepth
Universal Monocular Metric Depth Estimation
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
xiexh20/HDM
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
jellyheadandrew/CHORUS
riccardomarin/NICP
DaLi-Jack/SSR-code
Official implementation of 3DV24 paper "Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture"
city-super/GSDF
GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction
AviSoori1x/makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
CyberAgentAILab/SuperNormal
[CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"
adobe-research/affordance-insertion
ViLab-UCSD/OpenRooms
This is the dataset and code release of the OpenRooms Dataset. For more information, please refer to our webpage below. Thanks a lot for your interest in our research!
fuxiao0719/GeoWizard
[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
soniajoseph/ViT-Prisma
ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
yanqinJiang/Consistent4D
[ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video
heheyas/V3D
V3D: Video Diffusion Models are Effective 3D Generators
thu-ml/CRM
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
VAST-AI-Research/TripoSR
xhuangcv/humannorm
CVPR 2024: The official implementation of HumanNorm
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
HengyiWang/MorpheuS
[CVPR'24] MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video
jasonyzhang/RayDiffusion
Code for "Cameras as Rays"
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
oss-roettger/T5-Textual-Inversion
Textual Inversion for DeepFloyd IF
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Magicboomliu/Accelerator-Simple-Template
This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.
kxhit/EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis