Hello , Can LMFlow support Qwen1.5-1.8B model Fine-tuning?

Question

Hello , Can LMFlow support Qwen1.5-1.8B model Fine-tuning?

Closed this issue 8 months ago · 3 comments

Answer 1 · 2024-04-28T10:03:10.000Z

Thanks for your interest in LMFlow! We are integrating that feature right now, hopefully supporting it in 12-48 hours. Please stay tuned for our latest update 😄

Answer 2 · 2024-04-28T16:02:57.000Z

Hello , Can LMFlow support Qwen1.5-1.8B model Fine-tuning?

Hi, we've tested on Qwen1.5-1.8B and the script works fine.

Please make sure you include --lora_target_modules q_proj, v_proj (only for qwen models) in the finetune shell script:

Also, we strongly recommend you to:

Use a conversation dataset to finetune the model. You could either:
- To test the workflow, try to download a conversation dataset from out data server via:
```
 cd data && ./download.sh alpaca && cd -
```
  and specify the dataset to data/alpaca/train_conversation, or
- Prepare your own conversation dataset (see here)
Specify the conversation template to qwen2 for better performance.

Answer 3 · 2024-04-29T01:02:41.000Z

Hello , Can LMFlow support Qwen1.5-1.8B model Fine-tuning?

Hi, we've tested on Qwen1.5-1.8B and the script works fine.

Please make sure you include --lora_target_modules q_proj, v_proj (only for qwen models) in the finetune shell script:

Also, we strongly recommend you to:
Use a conversation dataset to finetune the model. You could either:
To test the workflow, try to download a conversation dataset from out data server via:
 cd data && ./download.sh alpaca && cd -
and specify the dataset to data/alpaca/train_conversation, or
Prepare your own conversation dataset (see here)
Specify the conversation template to qwen2 for better performance.

Thank you very much.