yixuantt/MultiHop-RAG

Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)

Python

Issues

Make public evaluation code
#4 opened 5 months ago by chubzchubz97
5
Response Accuracy for Table 6
#8 opened 6 months ago by Jenna-Che
1
will you make the code of constructing the dataset avaliable?
#7 opened 6 months ago by Zhouziyi828
1
What is the ground-truth evidence used for "ground-truth evidence" results in Table 6?
#6 opened 7 months ago by timchen0618
5
End2End Metric Need
#5 opened 8 months ago by HitAgain
1
How to do response evaluation
#3 opened 9 months ago by felicitywang1
2
missing evaluation metrics utils
#2 opened 9 months ago by hatianzhang
1