DarkGeekMS/portrait-synthesis-from-speech

Text2Face dataset processing and expansion

DarkGeekMS opened this issue · 0 comments

Process and expand the current Text2Face dataset through the following ideas :

  • Main idea :

    • Form pseudo-text descriptions from the CelebA attributes list : @MouhamedAhmed
    • Thus, the text descriptions with be matched with CelebA-HQ faces. Faces re-projection might be required to get unextended latent vectors (W) : @DarkGeekMS @MohamedRamzy1
  • Alternative idea :

    • Generate 200,000 faces from StyleGAN2 with their corresponding latent vector.
    • Get the top matches of the 4000 captioned faces from the generated database using FaceNet.
    • Filter the top matches manually :
    • Expand the dataset through :
      • Generate paraphrases for the face captions and use the multiple captions for the same face.
      • Get the closest k matches to a face, in order to have multiple data points from a single caption.