HeliosZhao's Stars
threestudio-project/threestudio
A unified framework for 3D content generation.
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
ashawkey/torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
magic-research/magic-edit
MagicEdit: High-Fidelity Temporally Coherent Video Editing
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
showlab/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Vchitect/LaVie
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
dcharatan/pixelsplat
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann
bytedance/MVDream
Multi-view Diffusion for 3D Generation
MV-Dream/MVDream
code page placeholder
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
yyeboah/Awesome-Text-to-3D
A growing curation of Text-to-3D, Diffusion-to-3D works.
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
modelscope/richdreamer
Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
sherwinbahmani/4dfy
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
NExT-ChatV/NExT-Chat
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
YilingQiao/DMRF
Dynamic Mesh-Aware Radiance Fields (ICCV2023): Raytracing rendering and interactive simulating mesh with NeRF
HeliosZhao/Animate124
Animate124: Animating One Image to 4D Dynamic Scene
jmhb0/view_neti
[ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models
MischaQI/Sniffer
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
zhaohengyuan1/SCT
[IJCV2023] Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"
AIBluefisher/PaperFigure
Take screenshot of your paper