Detail of TFI

Question

Detail of TFI

TungChintao opened this issue a year ago · 2 comments

TungChintao commented a year ago

Hello, I'm interested in your work. But here are two questions:

The article does not specify the number of fine-tune iterations for TFI.
I noticed that the finetune_reference only one kl variable is calculated, but this kl variable is not associated with the weight of self.reference_layer, how can I use kl_loss to update reference_layer?
Could you please provide more details? Thank you so much!

Answer 1 · 2023-09-19T16:36:45.000Z

A few fine-tuning step is enough. We fine-tuned around 50 steps.
You can set the reference_layer as the unique training module via your optimizer.

Answer 2 · 2023-10-28T03:42:52.000Z

Hi @TungChintao , did you manage to get the finetune code running? I found PATNetwork.Transformation_Feature() is not used in PATNetwork.finetune_reference()