joshinh/rich-feedback-reasoning

This repository contains the code and data for the AAAI workshop paper titled 'Improving Multi-Hop Reasoning in LLMs by Learning from Rich Human Feedback'

Stargazers

kevon217
Publicis Sapient
Naman-ntc
UC Berkeley