/awesome-dense-text-retrieval

πŸ“– A curated list of Dense Text Retrieval resources from all around the web.

MIT LicenseMIT

Awesome License

Dense Text Retrieval

Dense Text Retrieval (DTR) aims to overcome the limitations of sparse retrieval by matching in a continuous representation space learned via neural networks. [Learn more]

πŸ—‚ Datasets

Dataset Domain Language #Queries #Documents
πŸ“„ MS-MARCO General πŸ‡¬πŸ‡§ 1,010,900+ 8,841,800+
πŸ“„ Natural Questions General πŸ‡¬πŸ‡§ 91,500+
πŸ“„ COLIEE-2020 Law πŸ‡¬πŸ‡§, πŸ‡―πŸ‡΅ 808 768
πŸ“„ BSARD Law πŸ‡«πŸ‡· 1,108 22,633

πŸ”₯ Models

Model Language Size Year
πŸ“„ DPR πŸ‡¬πŸ‡§ 2020
πŸ“„ ORQA πŸ‡¬πŸ‡§ 2019
πŸ“„ DrQA πŸ‡¬πŸ‡§ 2017

πŸ“š Courses

πŸŽ™ Talks

  • 2021-09 Recent Developments in Neural Search, N. Reimers. [slides][video]
  • 2021-09 Training State-of-the-Art Neural Search Models, N. Reimers. [slides][video]

πŸ—“ Conferences & Workshops

  • The Text Retrieval Conference (TREC) Deep Learning Track. [link]
  • The Special Interest Group on Information Retrieval (SIGIR). [link]

☁️ Tools

  • SentenceTransformers: a Python framework for state-of-the-art sentence, text and image embeddings. [link]
  • Jina AI: a cloud-native neural search framework for any kind of data. [link]
  • Haystack: an end-to-end neural search framework for textual data. [link]