awesome-talking-head-generation

Papers for Talking Head Generation, released codes collections.

This repo mainly focus on the image-driven talking head generation task, but any addition or bug about other domain talking head generation,please open an issue, pull requests or e-mail me by fhongac@cse.ust.hk

Related Group

MMLab@NTU

Datasets

VoxCeleb1 [Download link].
VoxCeleb2 [Download link].
Faceforensics++ [Download link].
CelebV [Download link].
TalkingHead-1KH [Download link].
LRW (Lip Reading in the Wild) [Download link].

Image-driven

Audio-driven

2016

[LRW] Lip Reading in the Wild, ACCV 2016.

2019

[DAVS] Talking Face Generation by Adversarially Disentangled Audio-Visual Representation, AAAI 2019. [Code].
[ATVGnet] Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss, CVPR 2019. [Code]

2020

[Wav2Lip] A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild, ACM Multimedia 2020. [Code], [Project].
[RhythmicHead]Talking-head Generation with Rhythmic Head Motion, ECCV 2020. [Code].
[MakeItTalk] MakeItTalk: Speaker-Aware Talking-Head Animation, SIGGRAPH Asia 2020. [Code], [Project].

2021

[PC-AVS] Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation, CVPR 2021. [Code], [Project].
[IATS]Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis,ACM Multimedia 2021..
[EVP] Audio-Driven Emotional Video Portraits, CVPR 2021. [Code]
[FAU] Talking Head Generation with Audio and Speech Related Facial Action Units, arxiv 2021.
[Speech2Talking-Face] Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation, IJCAI 2021.
[IATS] Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis, ACM MM 2021.
[LSP] Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation, ACM TOG 2021.
[Audio2head] Audio2head: Audio-driven one-shot talking-head generation with natural head motion, ArXiv 2021.

2022

[GC-AVT] Expressive Talking Head Generation with Granular Audio-Visual Control , CVPR 2022.

Nerf-Head

2021

[DFA-NeRF] DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering, arxiv, 2021.
[NerFACE] NerFACE: Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction, CVPR 2021 Oral. [Code], [Project]

2022

[SSP-NeRFF] Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation, arxiv, 2022.
[HeadNeRF] HeadNeRF: A Real-time NeRF-based Parametric Head Model, CVPR 2022. [Code], [Project]
[IMavatar] I M Avatar: Implicit Morphable Head Avatars from Videos , CVPR 2022. [Code]
[ROME] Realistic One-shot Mesh-based Head Avatars , ECCV 2022.

Parameter-Based

2020

[DiscoFaceGAN ] Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning , CVPR 2020 Oral. [Code].

Survey

2020

What comprises a good talking-head video generation?: A Survey and Benchmark.

pfeducode/awesome-talking-head-generation

awesome-talking-head-generation

Related Group

Datasets

Image-driven

2016

2018

2019

2020

2021

2022

Audio-driven

2016

2019

2020

2021

2022

Nerf-Head

2021

2022

Parameter-Based

2020

Survey

2020