reward-modeling
There are 6 repositories under reward-modeling topic.
sileod/tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
YangLing0818/IterComp
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
quanshr/DMoERM
[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
allenai/hybrid-preferences
Learning to route instances for Human vs AI Feedback
MiuLab/DogeRM
The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"