Why not just crop the faces with their meta data from VoxCeleb since we already have the face bboxes?
Cold-Winter opened this issue · 3 comments
Cold-Winter commented
Thank you for the elegant implementation. It helps a lot!
I am wondering why you need to detect the faces from the VoxCeleb dataset since we already have the face bounding box meta data in this dataset? Are you trying to crop tighter face bboxs instead of using their boxes? What if we train the first order model with the faces cropped by their boxes?
charan223 commented
Any update on this?
brianw0924 commented
Same question