qianshuang/Comm100_FT_Wrapup

Python

训练方式——原生

CUDA_VISIBLE_DEVICES=0 python3 finetune.py # device_map="auto"指定无效

训练方式——LLaMA-Factory

git clone --depth 1 https://github.com/hiyouga/LLaMA-Factory.git && cd LLaMA-Factory
将训练数据放置于LLaMA-Factory/data目录下
在data/dataset_info.json中注册训练数据
CUDA_VISIBLE_DEVICES=0 llamafactory-cli train config/llama3_lora_dpo.yaml
llamafactory-cli export config/llama3_merge_lora.yaml