SysCV/sam-hq

About interm_embeddings

zzzyzh opened this issue · 0 comments

As you can see in the code at line https://github.com/SysCV/sam-hq/blob/322488826bda616798901c6280d13a9a90444ae7/train/segment_anything_training/modeling/image_encoder.py#L108C5-L120C36
You have defined a list to store the information for the middle layer, but in the actual application, you are only using the initial layer.
May I ask if the rest of the information is of any use?
Or did your experiments come up with any relevant structure?