In data2vec.py

Question

In data2vec.py

Closed this issue 6 months ago · 4 comments

HarshavardhanaTG commented 6 months ago

In data2vec.py, in line 90,

y = self.ema.model(trg, ~mask, **kwargs)['encoder_states']

shouldn't it have been,

y = self.ema.model(trg, None, **kwargs)['encoder_states'] (going by the training strategy in the paper)?

Answer 1 · 2024-03-05T04:59:08.000Z

Hello @HarshavardhanaTG ,
As far as I remember, the EMA model must take the ~mask. You can also verify this in the original fairseq implementation (V1 only)

Answer 2 · 2024-03-05T05:50:58.000Z

Hey @arxyzan, thank you so much for replying so quickly. Your repository has been of huge help!
with torch.no_grad():
self.ema.model.eval()

        if self.cfg.ema_transformer_only:
            y, layer_results = self.ema.model.extract_features(
                pre_encoder_features,
                padding_mask=padding_mask,
                min_layer=self.cfg.encoder_layers - self.average_top_k_layers,
            )
            y = {
                "x": y,
                "padding_mask": padding_mask,
                "layer_results": layer_results,
            }
        else:
            y = self.ema.model.extract_features(
                source=source,
                padding_mask=orig_padding_mask,
                mask=False,
            )

        target_layer_results = [l[2] for l in y["layer_results"]].

I think they did fix that issue. It's totally possible that I am mistaken. Please let me know if I am wrong. I am a bit confused about this part, the rest of your repo seemed absolutely fine. Thanks again!

Answer 3 · 2024-03-07T15:07:46.000Z

@HarshavardhanaTG Sorry for the late response. The code in the original implementation created extracted the mask in the forward method. But I decided to feed it in the dataset and as a parameter to the forward method. Either way is correct. The main thing to know here is that the original mask that is fed to the student model must be reversed and fed to the EMA (teacher) model.

Answer 4 · 2024-03-11T05:10:45.000Z

Thank you so much! That helps!