dense-retrieval
There are 50 repositories under dense-retrieval topic.
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
ict-bigdatalab/awesome-pretrained-models-for-information-retrieval
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
nomic-ai/contrastors
Train Models Contrastively in Pytorch
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
texttron/hyde
HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels
caiyinqiong/Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
luyug/Condenser
EMNLP 2021 - Pre-training architectures for dense retrieval
AmenRa/retriv
A Python Search Engine for Humans 🥸
Alibaba-NLP/Multi-CPR
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
luyug/GC-DPR
Train Dense Passage Retriever (DPR) with a single GPU
jingtaozhan/RepCONC
WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval
microsoft/SimXNS
SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.
ml4bio/Dense-Homolog-Retrieval
Nature Biotechnology: Ultra-fast, sensitive detection of protein remote homologs using deep dense retrieval
DevSinghSachan/art
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
jingtaozhan/disentangled-retriever
An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.
sebastian-hofstaetter/tas-balanced-dense-retrieval
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
McGill-NLP/topiocqa
Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering dataset
jingtaozhan/JPQ
CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
OpenMatch/COCO-DR
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".
TryMoreGroup/TryMore-PaperReading
揣摩研习社关注自然语言和信息检索前沿技术,解读热门科技论文,分享实用科研工具,挖掘人工智能冰山之下的学术和应用价值!
voidism/EAR
Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
PrithivirajDamodaran/SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by Prithivi Da, For PRs and Collaboration checkout the readme.
Albert-Ma/COSTA
SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction
FreedomIntelligence/DPTDR
Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
drogozhang/LED
Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)
MiuLab/PairDistill
Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.
maastrichtlawtech/gdsr
🕸️ A graph-augmented dense statute retriever. (EACL 2023)
yueyu1030/ReGen
[ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval".
OpenMatch/ANCE-Tele
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives".
jordane95/dual-cross-encoder
Dual Cross Encoder for Dense Retrieval
ArslanKAS/Large-Language-Models-with-Semantic-Search
Explore from keyword search to dense retrieval and reranking, which injects the intelligence of LLMs into your search system, making it faster and more effective.
cuongqn/recipe-qa
LLM-powered QA for food and recipes
dcarpintero/wikisearch
Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.
FreedomIntelligence/REMOP
Code for the paper: Modular Retrieval for Generalization and Interpretation.
amzn/extremely-efficient-query-encoder
efficient query encoding for dense retrieval