spacemanidol/MSMARCO
Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET
PythonMIT
Issues
- 2
Regarding the Test Set for Q&A
#38 opened by tahmedge - 2
dev_as_references.json is from V1?
#37 opened by deeplearningmachine - 1
- 4
- 1
Passage IDs in Qna Dataset
#34 opened by spacemanidol - 4
Partially duplicated passages extracted
#33 opened by daltonj - 6
[encoding,Â] top1000.dev.tsv
#26 opened by Albert-Ma - 1
Different number of queries in collectionandqueries.tar.gz and top1000.dev.tar.gz
#29 opened by rodrigonogueira4 - 1
KeyError for converttowellformed.py
#28 opened - 1
Need more explanation about Reranking Dataset
#27 opened by Planck35 - 1
encoding, top1000.train, qrels.train
#25 opened by linxihui - 3
Collection paragraph metadata
#19 opened by daltonj - 1
- 4
some bugs about the code
#2 opened by javacjh - 1
there are many bugs in the evaluation script
#3 opened by xinyadu - 1
Problem with Utils/tojson.py
#6 opened by xycforgithub - 1
No module named mrcqa.modules
#8 opened by wshinigamic - 1
[docs] Data is not JSONL
#9 opened by juharris - 5
- 2
- 1
- 2
How were passage reranking triples generated?
#14 opened by chsasank - 3
- 1
Cannot find the top1000.eval for testing
#17 opened by Qiaoyf96 - 2
- 1
BM25 relevance values for top 1000 eval/dev?
#21 opened by amirj - 2
Training data with QID and PID
#22 opened by QingyaoAi - 1