Multiple ckpt after training. Which one should I use for sampling?
Closed this issue · 2 comments
kasim0226 commented
Thank you for your great work.
After I exucted finetune_real.sh and get_deltas.py, I got 4 different ckpt.
- Which ckpt should I use for sampling?
- Could you teach me why you need to updated model weights with get_deltas.py? Because I thought model weights are trained well with eq. 4 in your paper.
nupurkmr9 commented
Hi, if you followed our readme instructions, you can sample using delta_epoch=000004.ckpt
according to the below command:
python sample.py --prompt "<new1> cat playing with a ball" --delta_ckpt logs/<folder-name>/checkpoints/delta_epoch\=000004.ckpt --ckpt <pretrained-model-path>
We save checkpoints every epoch of training.
To answer your second question, get_deltas.py
extracts and saves the subset of parameters that were fine-tuned instead of full model weights, thus reducing storage requirements to 75MB instead of the 4 GB original model.
I hope this clarifies your doubts. Thanks.
kasim0226 commented
Thank you for the reply.