Can I use train.py on CodeLlama-13b or CodeLlama-34b

Question

Can I use train.py on CodeLlama-13b or CodeLlama-34b

luo647 opened this issue a year ago · 4 comments

Answer 1 · 2023-09-21T06:11:38.000Z

I also have the same question.

Answer 2 · 2023-09-30T19:54:29.000Z

I haven't personally tried it, but from what I can see in the code you just need to add the other models to BASE_MODELS in common.py and invoke train.py with --base set to whatever new key you added.

Answer 3 · 2023-11-13T01:11:04.000Z

I also have the same question, what hardware (min GPU ram required) can be fine tuning CodeLlama-34b ?

Answer 4 · 2024-03-11T20:42:48.000Z

We now ship this with several example configuration files demonstrating how to use different models!