kangzhao2's Stars
transformer-vq/transformer_vq
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
rosinality/stylegan2-pytorch
Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch
richzhang/PerceptualSimilarity
LPIPS metric. pip install lpips
Seanseattle/MobileFaceSwap
MobileFaceSwap: A Lightweight Framework for Video Face Swapping (AAAI 2022)
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
eric-ai-lab/photoswap
Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"
neuralchen/SimSwap
An arbitrary face-swapping framework on images and videos with one single trained model!
Seanseattle/StyleSwap
StyleSwap: Style-Based Generator Empowers Robust Face Swapping (ECCV 2022)
wl-zhao/DiffSwap
[CVPR 2023] DiffSwap is a diffusion-based face-swapping framework.
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
IGLICT/SketchFaceNeRF
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
ashawkey/RAD-NeRF
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
felixkreuk/UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
felixkreuk/SegFeat
Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Jiawei-Yang/FreeNeRF
[CVPR23] FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization
chenfei-wu/TaskMatrix
NVlabs/eg3d
apchenstu/TensoRF
[ECCV 2022] Tensorial Radiance Fields, a novel approach to model and reconstruct radiance fields
google/nerfies
This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.
yerfor/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
joonson/syncnet_python
Out of time: automated lip sync in the wild