yixuantt/MultiHop-RAG
Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)
Python
Issues
- 5
Make public evaluation code
#4 opened by chubzchubz97 - 1
Response Accuracy for Table 6
#8 opened by Jenna-Che - 1
- 5
What is the ground-truth evidence used for "ground-truth evidence" results in Table 6?
#6 opened by timchen0618 - 1
End2End Metric Need
#5 opened by HitAgain - 2
How to do response evaluation
#3 opened by felicitywang1 - 1
missing evaluation metrics utils
#2 opened by hatianzhang