"pre-training" section in the readme

Question

"pre-training" section in the readme

johntiger1 opened this issue 5 years ago · 1 comments

Just want to confirm, when you talk about "pre-training" in the readme (https://github.com/airsplay/lxmert#pre-training) you mean training the entire LXMERT model from scratch?

If we just want to use a trained LXMERT model (and stick on a classification or LSTM layer at the end), we can just use the pre-trained model link you provided: http://nlp.cs.unc.edu/data/model_LXRT.pth, load your model, freeze the weights and then finetune with our specific task, right?

Thanks

Answer 1 · 2020-03-23T01:00:25.000Z

Wrong repo, sorry