adobe-research/custom-diffusion

Multiple ckpt after training. Which one should I use for sampling?

Closed this issue · 2 comments

Thank you for your great work.

After I exucted finetune_real.sh and get_deltas.py, I got 4 different ckpt.

  1. Which ckpt should I use for sampling?
  2. Could you teach me why you need to updated model weights with get_deltas.py? Because I thought model weights are trained well with eq. 4 in your paper.

Hi, if you followed our readme instructions, you can sample using delta_epoch=000004.ckpt according to the below command:

python sample.py --prompt "<new1> cat playing with a ball" --delta_ckpt logs/<folder-name>/checkpoints/delta_epoch\=000004.ckpt --ckpt <pretrained-model-path>

We save checkpoints every epoch of training.
To answer your second question, get_deltas.py extracts and saves the subset of parameters that were fine-tuned instead of full model weights, thus reducing storage requirements to 75MB instead of the 4 GB original model.

I hope this clarifies your doubts. Thanks.

Thank you for the reply.