joshinh/rich-feedback-reasoning
This repository contains the code and data for the AAAI workshop paper titled 'Improving Multi-Hop Reasoning in LLMs by Learning from Rich Human Feedback'
This repository contains the code and data for the AAAI workshop paper titled 'Improving Multi-Hop Reasoning in LLMs by Learning from Rich Human Feedback'