Pinned Repositories
ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
DecitionTree
decitTree
LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF)
lmft
Language Model Fine-Tuning, for ChatGLM, BELLE, LLaMA fine-tuning.
Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
probable-couscous
DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
llmeval-3
中文大语言模型评测第三期
AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
Chenzongchao's Repositories
Chenzongchao/ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
Chenzongchao/DecitionTree
Chenzongchao/decitTree
Chenzongchao/LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF)
Chenzongchao/lmft
Language Model Fine-Tuning, for ChatGLM, BELLE, LLaMA fine-tuning.
Chenzongchao/Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
Chenzongchao/probable-couscous