dense-retrieval

There are 50 repositories under dense-retrieval topic.

  • PaddlePaddle/RocketQA

    🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

    Language:Python77319107127
  • ict-bigdatalab/awesome-pretrained-models-for-information-retrieval

    A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

  • nomic-ai/contrastors

    Train Models Contrastively in Pytorch

    Language:Python567124143
  • texttron/tevatron

    Tevatron - A flexible toolkit for neural retrieval research and development.

    Language:Python55011101100
  • SeanLee97/AnglE

    Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

    Language:Python502105034
  • texttron/hyde

    HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

    Language:Jupyter Notebook4686833
  • caiyinqiong/Semantic-Retrieval-Models

    A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

  • Condenser

    luyug/Condenser

    EMNLP 2021 - Pre-training architectures for dense retrieval

    Language:Python24462523
  • retriv

    AmenRa/retriv

    A Python Search Engine for Humans 🥸

    Language:Python201104022
  • Alibaba-NLP/Multi-CPR

    [SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

    Language:Python1763719
  • GC-DPR

    luyug/GC-DPR

    Train Dense Passage Retriever (DPR) with a single GPU

    Language:Python12931220
  • jingtaozhan/RepCONC

    WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval

    Language:Python1184513
  • microsoft/SimXNS

    SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.

    Language:Python11171211
  • ml4bio/Dense-Homolog-Retrieval

    Nature Biotechnology: Ultra-fast, sensitive detection of protein remote homologs using deep dense retrieval

    Language:Python994264
  • DevSinghSachan/art

    Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"

    Language:Python62334
  • jingtaozhan/disentangled-retriever

    An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.

    Language:Python59235
  • sebastian-hofstaetter/tas-balanced-dense-retrieval

    SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling

    Language:Jupyter Notebook58245
  • McGill-NLP/topiocqa

    Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering dataset

    Language:Python52566
  • jingtaozhan/JPQ

    CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.

    Language:Python511411
  • OpenMatch/COCO-DR

    [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".

    Language:Python50444
  • TryMoreGroup/TryMore-PaperReading

    揣摩研习社关注自然语言和信息检索前沿技术,解读热门科技论文,分享实用科研工具,挖掘人工智能冰山之下的学术和应用价值!

  • voidism/EAR

    Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering

    Language:Python35132
  • PrithivirajDamodaran/SPLADERunner

    Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by Prithivi Da, For PRs and Collaboration checkout the readme.

    Language:Python29322
  • Albert-Ma/COSTA

    SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction

    Language:Python25623
  • FreedomIntelligence/DPTDR

    Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval

    Language:Python25335
  • drogozhang/LED

    Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)

    Language:Python22131
  • MiuLab/PairDistill

    Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.

    Language:Jupyter Notebook21312
  • maastrichtlawtech/gdsr

    🕸️ A graph-augmented dense statute retriever. (EACL 2023)

    Language:Python20207
  • yueyu1030/ReGen

    [ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval".

    Language:Python20112
  • OpenMatch/ANCE-Tele

    Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives".

    Language:Python18231
  • jordane95/dual-cross-encoder

    Dual Cross Encoder for Dense Retrieval

    Language:Python16212
  • ArslanKAS/Large-Language-Models-with-Semantic-Search

    Explore from keyword search to dense retrieval and reranking, which injects the intelligence of LLMs into your search system, making it faster and more effective.

    Language:Jupyter Notebook141012
  • cuongqn/recipe-qa

    LLM-powered QA for food and recipes

    Language:Python14101
  • dcarpintero/wikisearch

    Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.

    Language:Python13201
  • FreedomIntelligence/REMOP

    Code for the paper: Modular Retrieval for Generalization and Interpretation.

    Language:Python13300
  • amzn/extremely-efficient-query-encoder

    efficient query encoding for dense retrieval

    Language:Python11311