RuntimeError: Error(s) in loading state_dict for ContextualCLIP: size mismatch XXXXXXX

Question

RuntimeError: Error(s) in loading state_dict for ContextualCLIP: size mismatch XXXXXXX

Closed this issue 6 months ago · 0 comments

Hello author,
I encountered the following bugs when I tried to reproduce the results of the paper. It seems that the size of checkpoint(pretrain_BART_generator_coldstart_OFA) you provided on huggingface doesn't match the ''current model''.

Traceback (most recent call last):
File "/datasata0/cloud-wuzhengyuan/lxj/NDCR/OFA_encoder_Divide_and_Conquer.py", line 426, in
contextual_clip.load_state_dict(checkpoint['model_state_dict'], False)
File "/root/miniconda3/envs/blip/lib/python3.9/site-packages/torch/nn/modules/module.py", line 2152, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for ContextualCLIP:
size mismatch for text_encoder.model.shared.weight: copying a param with shape torch.Size([50265, 768]) from checkpoint, the shape in current model is torch.Size([50265, 1024]).
size mismatch for text_encoder.model.encoder.embed_tokens.weight: copying a param with shape torch.Size([50265, 768]) from checkpoint, the shape in current model is torch.Size([50265, 1024]).
size mismatch for text_encoder.model.encoder.embed_positions.weight: copying a param with shape torch.Size([1026, 768]) from checkpoint, the shape in current model is torch.Size([1026, 1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.k_proj.weight: copying a param with shape torch.Size([768, 768]) from checkpoint, the shape in current model is torch.Size([1024, 1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.k_proj.bias: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.v_proj.weight: copying a param with shape torch.Size([768, 768]) from checkpoint, the shape in current model is torch.Size([1024, 1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.v_proj.bias: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.q_proj.weight: copying a param with shape torch.Size([768, 768]) from checkpoint, the shape in current model is torch.Size([1024, 1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.q_proj.bias: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.out_proj.weight: copying a param with shape torch.Size([768, 768]) from checkpoint, the shape in current model is torch.Size([1024, 1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn.out_proj.bias: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn_layer_norm.weight: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
size mismatch for text_encoder.model.encoder.layers.0.self_attn_layer_norm.bias: copying a param with shape torch.Size([768]) from checkpoint, the shape in current model is torch.Size([1024]).
......