RuntimeError in loading state_dict when calling CLVPMetric(device='cuda')

Question

RuntimeError in loading state_dict when calling CLVPMetric(device='cuda')

Opened this issue a year ago · 5 comments

Error details:

RuntimeError: Error(s) in loading state_dict for CLVP:
   Missing key(s) in state_dict: "text_pos_emb.weight", "text_transformer.layers.layers.0.0.scale" ....
   .....  mismatch for to_speech_latent.weight: copying a param with shape torch.Size([1024, 1024]) from checkpoint, the shape in current model is torch.Size([512, 512]).

Answer 1 · 2023-08-28T14:34:05.000Z

Ah apologies - this repo uses the original CLVP: https://huggingface.co/jbetker/tortoise-tts-v2/blob/main/.models/clvp.pth

It would be better if it were updated to use the bigger one.

Answer 2 · 2023-08-29T04:47:26.000Z

Thanks for getting back, I am still getting the same error with the older clvp.

Answer 3 · 2023-08-30T01:30:57.000Z

That doesn't seem right; the error should at least be different (CLVP1 has d_model=512, CLVP2 has d_model=1024)

…

On Mon, Aug 28, 2023 at 10:47 PM Rishav Kumar ***@***.***> wrote: Thanks for getting back, I am still getting the same error with the older clvp. — Reply to this email directly, view it on GitHub <#6 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGLMOSFX3PZH6WLJRDXTKDXXVX6RANCNFSM6AAAAAA4BJRRAI> . You are receiving this because you commented.Message ID: ***@***.***>

-- - James Betker

Answer 4 · 2023-08-30T03:45:48.000Z

Yeah, you are right, the current error isn't exactly same as previous, I meant that it is still about the size mismatch
RuntimeError: Error(s) in loading state_dict for CLVP:

Missing key(s) in state_dict: "text_pos_emb.weight", "text_transformer.layers.layers.0.0.scale", "text_transformer.layers.layers.0.0.fn.norm.weight",  .......
......size mismatch for text_emb.weight: copying a param with shape torch.Size([256, 512]) from checkpoint, the shape in current model is torch.Size([148, 512]).

Answer 5 · 2024-10-21T16:18:16.000Z

Has this issue been solved? I get the same mistake as the previous commenter