Papers for Talking Head Generation, released codes collections.

This repo mainly focus on the image-driven talking head generation task, but any addition or bug about other domain talking head generation,please open an issue, pull requests or e-mail me by If you are researching in talking head generation task, you can add my discord account: Fa-Ting Hong#6563 for better communication and cooperations.

  1. VoxCeleb1 [Download link].
  2. VoxCeleb2 [Download link].
  3. Faceforensics++ [Download link].
  4. CelebV [Download link].
  5. TalkingHead-1KH [Download link].
  6. LRW (Lip Reading in the Wild) [Download link].
  7. MEAD [Download link].
  8. CelebV-HQ [Download link].



  1. [Face2face] Face2face: Real-time face capture and reenactment of RGB videos, CVPR 2016.


  1. [ReenactGAN] ReenactGAN: Learning to Reenact Faces via Boundary Transfer, ECCV 2018. [Code].
  2. [X2Face] X2Face: A network for controlling face generation by using images, audio, and pose codes, ECCV 2018. [Code], [Project].


  1. [FOMM] First order motion model for image animation, NeurIPS 2019. [Code].
  2. [NeuralHead]Few-Shot Adversarial Learning of Realistic Neural Talking Head models, ICCV 2019. [Code].
  3. [Monkey-Net]Animating Arbitrary Objects via Deep Motion Transfer, CVPR 2019 Oral. [Code], [Project].
  4. [fs-vid2vid]Few-shot Video-to-Video Synthesis, NeurIPS 2019. [Code], [Project].


  1. [MeshG] Mesh Guided One-shot Face Reenactment Using Graph Convolutional Networks, ACM Multimedia 2020. [Code].

  2. [MarioNETte] MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets, AAAI 2020. [Project].

  3. [CrossID-GAN] Learning Identity-Invariant Motion Representations for Cross-ID Face Reenactment, CVPR 2020.


  1. [face-vid2vid] One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing, CVPR 2021 Oral. [Project].

  2. [S2D] Sparse to Dense Motion Transfer for Face Image Animation, ICCV 2021.

  3. [SAFA] SAFA: Structure Aware Face Animation, 3DV 2021. [Code]

  4. [SAA] Self-appearance-aided Differential Evolution for Motion Transfer, arXiv 2021.

  5. [PIRenderer]PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering, ICCV 2021. [Code]

  6. [FaceGAN]FACEGAN: Facial Attribute Controllable rEenactment GAN, WACV 2021.

  7. [F^3A-GAN]F3A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks, IEEE TIP 2021.

  8. [FACIAL]FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning, ICCV 2021.

  9. [MRAA] Motion Representations for Articulated Animation, CVPR 2021. [Code]

  10. [HeadGAN]HeadGAN: One-shot Neural Head Synthesis and Editing, ICCV 2021. [Project]


  1. [DaGAN]Depth-Aware Generative Adversarial Network for Talking Head Video Generation, CVPR 2022. [Code], [Project]

  2. [TPSM]Thin-Plate Spline Motion Model for Image Animation, CVPR 2022. [Code]

  3. [StyleHEAT]StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pretrained StyleGAN, ECCV 2022. [Code], [Project]

  4. [MegaPortraits]MegaPortraits: One-shot Megapixel Neural Head Avatars, ACM MM 2022. [Project]

  5. [DAM]Structure-Aware Motion Transfer with Deformable Anchor Model, CVPR 2022. [Code]

  6. [StyleMask]StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment, FG, 2023. [Code]

  7. [CoRF]Controllable Radiance Fields for Dynamic Face Synthesis, Arxiv 2022.

  8. [AniFaceGAN]Animatable 3D-Aware Face Image Generation for Video Avatars, NeurIPS 2022. [Project]

  9. [IW]Implicit Warping for Animation with Image Sets, NeurIPS 2022. [Project]

  10. [HifiHead]HifiHead: One-Shot High Fidelity Neural Head Synthesis with 3D Control, IJCAI 2022.

  11. Face Animation with Multiple Source Images, Arxiv 2022.



  1. [LRW] Lip Reading in the Wild, ACCV 2016.


  1. [Synthesizing-Obama] Synthesizing Obama: Learning Lip Sync From Audio, SIGGRAPH 2017. [Project].
  2. [You-Said-That?] You Said That?: Synthesising Talking Faces From Audio, IJCV 2019. [Code].
  3. Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion, SIGGRAPH 2017.
  4. A Deep Learning Approach for Generalized Speech Animation, SIGGRAPH 2017.


  1. Lip Movements Generation at a Glance, ECCV 2018. [Code].
  2. [VisemeNet] VisemeNet: Audio-Driven Animator-Centric Speech Animation, SIGGRAPH 2018.


  1. [DAVS] Talking Face Generation by Adversarially Disentangled Audio-Visual Representation, AAAI 2019. [Code].
  2. [ATVGnet] Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss, CVPR 2019. [Code]


  1. [Wav2Lip] A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild, ACM Multimedia 2020. [Code], [Project].
  2. [RhythmicHead]Talking-head Generation with Rhythmic Head Motion, ECCV 2020. [Code].
  3. [MakeItTalk] MakeItTalk: Speaker-Aware Talking-Head Animation, SIGGRAPH Asia 2020. [Code], [Project].
  4. Neural Voice Puppetry: Audio-driven Facial Reenactment, ECCV 2020. [Project].
  5. [MEAD] MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation, ECCV 2020. [Code], [Project].
  6. Realistic Speech-Driven Facial Animation with GANs, IJCV 2020.


  1. [PC-AVS] Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation, CVPR 2021. [Code], [Project].
  2. [IATS]Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis,ACM Multimedia 2021..
  3. [EVP] Audio-Driven Emotional Video Portraits, CVPR 2021. [Code]
  4. [FAU] Talking Head Generation with Audio and Speech Related Facial Action Units, arxiv 2021.
  5. [Speech2Talking-Face] Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation, IJCAI 2021.
  6. [IATS] Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis, ACM MM 2021.
  7. [LSP] Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation, ACM TOG 2021.
  8. [Audio2head] Audio2head: Audio-driven one-shot talking-head generation with natural head motion, ArXiv 2021.


  1. [GC-AVT] Expressive Talking Head Generation with Granular Audio-Visual Control , CVPR 2022.
  2. Talking Face Generation with Multilingual TTS, CVPR 2022. [Demo Track].
  3. [EAMM] EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model, SIGGRAPH 2022.
  4. [SPACEx] SPACEx 🚀: Speech-driven Portrait Animation with Controllable Expression, Arxiv 2022. [Project]

Nerf & 3D


  1. [DFA-NeRF] DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering, arxiv, 2021.
  2. [NerFACE] NerFACE: Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction, CVPR 2021 Oral. [Code], [Project]


  1. [SSP-NeRFF] Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation, arxiv, 2022.

  2. [HeadNeRF] HeadNeRF: A Real-time NeRF-based Parametric Head Model, CVPR 2022. [Code], [Project]

  3. [IMavatar] I M Avatar: Implicit Morphable Head Avatars from Videos, CVPR 2022. [Code]

  4. [ROME] Realistic One-shot Mesh-based Head Avatars, ECCV 2022.

  5. [FNeVR] FNeVR: Neural Volume Rendering for Face Animation, Arxiv 2022. [Code]

  6. [3DFaceShop] 3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation, Arxiv 2022. [Code],[Project]

  7. [Next3D] Generative Neural Texture Rasterization for 3D-Aware Head Avatars, Arxiv 2022.[Project]

  8. [NeRFInvertor] NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation, Arxiv 2022.



  1. [DiscoFaceGAN ] Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning , CVPR 2020 Oral. [Code].



  1. What comprises a good talking-head video generation?: A Survey and Benchmark.