基于大模型ChatGLM,微调方式为LORA,集SFT、RM、PPO算法为一体项目
Primary LanguagePythonApache License 2.0Apache-2.0
No issues in this repository yet.