About your implementation

Question

moonman925 opened this issue 3 years ago · 2 comments

Hi, thx for sharing your codes, great works!
I have serval questions about the implementatation.

according to the issue from the official hififace github repo, both masks are activated by sigmoid but I noticed that you used tanh.
did you encounter with situation like, the mask learns a all-zero mask, which cause the model generate the exact same image as target image ?

Answer 1 · 2021-12-07T09:39:14.000Z

Hi! Thank you for your interest!

Are you talking about this issue on the official project Github? Honestly, I didn’t know this existed. I decided on some details that were not mentioned on the paper, like activation for masks. I don't think this makes a very big difference during training.
That shouldn't be happening. Segmentation loss is included in the loss function, which forces the mask to match the ground truth mask.

Answer 2 · 2022-12-19T21:44:51.000Z

Hi, Thx for your works.

I also have some question about slight difference in your implementation.

In paper, author use both face recogntion model's id and 3dMM's identity, expression, posture vector to construct V_id. However, I found that your implementation only use face recognition model's id. Does this difference was intended? I was also curious about what could go wrong when we drop that from V_id formulation
How did you get dilated mask for SFF model's target? I couldn't found any specific guide to get target mask in both paper and Mask preprocessing provided in README.md.