Multiple ckpt after training. Which one should I use for sampling?

Question

Multiple ckpt after training. Which one should I use for sampling?

Closed this issue 2 years ago · 2 comments

Thank you for your great work.

After I exucted finetune_real.sh and get_deltas.py, I got 4 different ckpt.

Which ckpt should I use for sampling?
Could you teach me why you need to updated model weights with get_deltas.py? Because I thought model weights are trained well with eq. 4 in your paper.

Answer 1 · 2023-04-21T18:30:41.000Z

Hi, if you followed our readme instructions, you can sample using delta_epoch=000004.ckpt according to the below command:

python sample.py --prompt "<new1> cat playing with a ball" --delta_ckpt logs/<folder-name>/checkpoints/delta_epoch\=000004.ckpt --ckpt <pretrained-model-path>

We save checkpoints every epoch of training.
To answer your second question, get_deltas.py extracts and saves the subset of parameters that were fine-tuned instead of full model weights, thus reducing storage requirements to 75MB instead of the 4 GB original model.

I hope this clarifies your doubts. Thanks.

Answer 2 · 2023-04-28T08:59:10.000Z

Thank you for the reply.