This repository contains the dataset for the paper, where we present a bilingual query-based biomedical information retrieval task across two vastly different genres – Chinese newswire and English research literature. For this task, we developed a specialized IR dataset. It was constructed in two stages: first, we created a gold-standard dataset, which was then expanded into a silver-standard corpus.
- For gold-standard dataset, see Gold-standard dataset
- For silver-standard dataset, see Silver-standard dataset