Feature alignment loss

Question

Feature alignment loss

schatto02 opened this issue 5 months ago · 1 comments

Hi, thanks for such great work ! I was wondering which features are used for this loss --- do we use intermediate features or the final encoder features?

Also, if the student and teacher feature dimensions are different, what kind of projection is used to bring them to a compatible feature space?

Answer 1 · 2024-08-19T02:54:09.000Z

We use the final encoder features.
We use the same student-teacher structure (e.g., both ViT-Large) for alignment, so the dimensions are the same. If the dimensions are different, we recommend adding a linear projection layer on top of the student features.