syguan96
AIGC Researcher@vivo. I'm also interested in physical world modeling and understanding.
vivoShanghai
Pinned Repositories
BOA
[CVPR 2021] Bilevel Online Adaptation for Human Mesh Reconstruction
DynaBOA
[T-PAMI 2022] Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation
facetools
Easy-to-use face related tools, including face detection, landmark localization, alignment & recognition, based on PyTorch.
Image2StyleGAN2
I reframe it to Image2StyleGAN2
Image2StyleGAN3
Inverse an image to the latent space of StyleGAN3
NeuMA
[NeurIPS 2024] NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics
NeuroFluid
[ICML 2022] NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields
Novel-StyleGAN-Inversion-Papers
Interesting StyleGAN-related papers. Focusing on StyleGAN inversion.
Pix2Video.pytorch
Implementation of the paper "Pix2Video: Video Editing using Image Diffusion"
SIP_dataset
a new large scale dataset for sparse inertial poser
syguan96's Repositories
syguan96/DiffSwap
[CVPR 2023] DiffSwap is a diffusion-based face-swapping framework.
syguan96/SLR-SFS
Code release for the paper "Simulating Fluids in Real-World Still Images"
syguan96/MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
syguan96/pexels.com-bulk-downloads-videos
Download bulks videos on pexels.com with this simple Python script.
syguan96/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
syguan96/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
syguan96/MotionBERT
[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"
syguan96/gradsim
Differentiable simulation for system identification and visuomotor control
syguan96/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
syguan96/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
syguan96/latent-slot-diffusion
Official release of NeurIPS 2023 Spotlight paper LSD: Object-Centric Slot Diffusion
syguan96/NeuroFluid
[ICML 2022] NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields
syguan96/hackGPT
I leverage OpenAI and ChatGPT to do hackerish things
syguan96/3d-photo-inpainting
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
syguan96/ReMoDiffuse
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
syguan96/syguan96
syguan96/CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
syguan96/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
syguan96/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
syguan96/minSDXL
Huggingface-compatible SDXL Unet implementation that is readily hackable
syguan96/fastcomposer
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
syguan96/threestudio
A unified framework for 3D content generation.
syguan96/hellollm
pre train a new llm
syguan96/HyperKohaku
A diffusers based implementation of HyperDreamBooth
syguan96/RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
syguan96/AnimateDiff
Official implementation of AnimateDiff.
syguan96/StyleDrop-PyTorch
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
syguan96/motion-latent-diffusion
(CVPR 2023) Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
syguan96/sd-webui-roop
roop extension for StableDiffusion web-ui
syguan96/taichi_3d_gaussian_splatting