DmitriyVahrushev's Stars
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
cszn/BSRGAN
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!
Kedreamix/Awesome-Talking-Head-Synthesis
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
ashawkey/RAD-NeRF
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
OML-Team/open-metric-learning
Metric learning and retrieval pipelines, models and zoo.
Meta-Portrait/MetaPortrait
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
FuxiVirtualHuman/styletalk
wuhuikai/GP-GAN
Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)
soumik-kanad/diff2lip
theEricMa/OTAvatar
This is the official repository for OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering [CVPR2023].
yuangan/EAT_code
Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
guanjz20/StyleSync_PyTorch
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
nihaomiao/WACV23_TSNet
The pytorch implementation of our WACV23 paper "Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis".
yylgoodlucky/HDTR-Net
A Real-Time High-Definition Teeth Restoration Network for ArbitraryTalking Face Generation Methods
kenwaytis/faster-SadTalker-API
The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!
deepbrainai-research/discohead
bychen7/Face-Restoration-TensorRT
A simple face restoration TensorRT deployment solution.
CVMI-Lab/Speech2Lip
[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Jason-cs18/awesome-avatar
📖 A curated list of resources dedicated to avatar.
langzizhixin/wav2lip-576x576
This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital human videos.
g-milis/NEUTART
PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.
foocker/SadTalkerTriton