Problems in the process of reproducing the code
Opened this issue · 5 comments
Thank you for open-sourcing the code!
In the process of reproducing, the following problems appeared:
Error(s) in loading state_dict for BertForSequenceClassification:
size mismatch for classifier.weight: copying a param with shape torch.Size([4, 768]) from checkpoint, the shape in current model is torch.Size([2, 768]).
size mismatch for classifier.bias: copying a param with shape torch.Size([4]) from checkpoint, the shape in current model is torch.Size([2]).
I haven't solved it, can you help me? Thanks!
Hi, this error came out when you were loading the model parameters. Could you re-check the number of labels in the model configuration (currently you are using 2)? It should match the model parameters, which is 4 for the number of labels.
Thank you for the code!
I will appreciate it if you could explain the logic of the version of python, cause I can't run the code successfully. When I turn to run the setup.py file in the ESIM folder, message tells me to use python3.7. Could you please run the code again and help me with the base environment to run?
As mentioned in the requirements.txt file, the python version should be 3.6
When I wrote this code, I used python 3.6 so I am sure it should work.