GPT4animal

GPT4animal's Stars

huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.2k 73 1.1k1.1k
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k 21 179168
facebookresearch/SEAL
Search Engines with Autoregressive Language models
Language:Python275 7 1324
Lichang-Chen/InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
Language:Python171 8 814
gpt4life/alpagasus
Unofficial implementation of AlpaGasus
Language:Python83 3 66
gauss5930/AlpaGasus2-QLoRA
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
Language:Python14 1 22