XingruiWang's Stars
Stability-AI/generative-models
Generative Models by Stability AI
Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
rese1f/StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
minghanqin/LangSplat
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
cure-lab/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
threedworld-mit/tdw
ThreeDWorld simulation environment
LuciNyan/pixel-profile
Generate a pixel art style profile card from your GitHub data! ✨
UM-ARM-Lab/pytorch_kinematics
Robot kinematics implemented in pytorch
locuslab/lcp-physics
A differentiable LCP physics engine in PyTorch.
Hramchenko/diffusion_distiller
🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"
xudejing/video-question-answering
Video Question Answering via Gradually Refined Attention over Appearance and Motion
Aleafy/Make_it_Real
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
snap-research/discoscene
CVPR 2023 Highlight: DiscoScene
ifsheldon/stannum
Fusing Taichi into PyTorch
ethanweber/nerfiller
NeRFiller project https://ethanweber.me/nerfiller/
facebookresearch/EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
AnjieCheng/TUVF
[ICLR'24] This repository is the implementation of "TUVF: Learning Generalizable Texture UV Radiance Fields"
dingmyu/VRDP
[NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
lzhangbj/ASVA
[ECCV 2024 Oral] Audio-Synchronized Visual Animation
wufeim/NeMo
Neural mesh models for 3D reasoning.
uvavision/SimVQA
[CVPR 2022] SimVQA: Exploring Simulated Environments for Visual Question Answering
shutterstock-is-cringe/webvid
Large-scale text-video dataset. 10 million captioned short videos.