HybridRetriever raise KeyError: -1 if the len of doc less than 1_000
tshu-w opened this issue · 1 comments
tshu-w commented
The cutoff of msearch
for HybridRetriever
is hardcode to 1_000, which makes map_internal_ids_to_original_ids
raise KeyError
when doc len less than 1_000
retriv/retriv/hybrid_retriever.py
Lines 254 to 255 in c9baa01
Thus, map_internal_ids_to_original_ids
should be:
def map_internal_ids_to_original_ids(self, doc_ids: Iterable) -> List[str]:
return [self.id_mapping[doc_id] for doc_id in doc_ids if doc_id != -1]
AmenRa commented
Thanks for reporting the bug!
I'll fix it soon.