/rich-feedback-reasoning

This repository contains the code and data for the AAAI workshop paper titled 'Improving Multi-Hop Reasoning in LLMs by Learning from Rich Human Feedback'

Stargazers