finetuned using lamma-13B

Question

finetuned using lamma-13B

Huangbukun opened this issue 9 months ago · 3 comments

Hello, if I want to use llama-13B's pth for fine-tuning, what changes need to be made to the train.sh script? After fine-tuning according to the parameters of llama-7B, the accuracy is very low.

Answer 1 · 2023-12-26T08:42:32.000Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

Answer 2 · 2023-12-27T05:42:10.000Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

thank you!

Answer 3 · 2023-12-27T07:47:52.000Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

Hello, after I adjusted --adapter_layer to 40, I changed the learning rate to 9e-3, but your result in the double line can only reach 65%.I don't know what I did wrong