yanhn's Stars
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
CompVis/stable-diffusion
A latent text-to-image diffusion model
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
openai/point-e
Point cloud diffusion for 3D model synthesis
timothybrooks/instruct-pix2pix
kohya-ss/sd-scripts
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
runwayml/stable-diffusion
Latent Text-to-Image Diffusion
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
autonomousvision/sdfstudio
A Unified Framework for Surface Reconstruction
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
sicxu/Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
ZrrSkywalker/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
lzzcd001/MeshDiffusion
Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)
ICT-VGL/ICT-FaceKit
ICT's Vision and Graphics Lab's morphable face model and toolkit
PlayVoice/lora-svc
singing voice change based on whisper, and lora for singing voice clone
liuyuan-pal/NeRO
[SIGGRAPH2023] NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images
ziqihuangg/ReVersion
ReVersion: Diffusion-Based Relation Inversion from Images
suxuann/ddib
Dual Diffusion Implicit Bridges for Image-to-Image Translation. ICLR 2023.
MomoAILab/ultrapose
Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.
limanling/KnowledgeVL-Reading
yanhn/AugmentationTutorial
some basic data augmentation method