reward-model
There are 7 repositories under reward-model topic.
Westlake-AI/SemiReward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
rochitasundar/Generative-AI-with-Large-Language-Models
This repository contains the lab work for Coursera course on "Generative AI with Large Language Models".
hlp-ai/miniChatGPT
Mini ChatGPT
taishan1994/Reward-Model-Finetuning
专门用于训练奖励模型的仓库。
techandy42/LLM_Reward_Model
Developing a LLM response ranking reward model using HFRL except it's GPT-3.5 instead of human.
jddunn/rlhf
POC library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO