ChanLIM's Stars
milvus-io/milvus-lite
A lightweight version of Milvus
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
amzn/extremely-efficient-query-encoder
efficient query encoding for dense retrieval
irgroup/repro_eval
A Python Interface to Reproducibility Measures of System-Oriented IR Experiments
changyaochen/rbo
Implementation of Rank-biased Overlap
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
ielab/PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
microsoft/MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
eunseongc/SpaDE
This is the official implementation of SpaDE. (CIKM'22)
goharbor/harbor
An open source trusted cloud native registry project that stores, signs, and scans content.
princeton-nlp/EntityQuestions
EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535
bupt-ai-cz/PGDF
Sample Prior Guided Robust Model Learning to Suppress Noisy Labels
sebastian-hofstaetter/matchmaker
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
thunlp/SOS4NLP
Survey of Surveys for Natural Language Processing (SOS4NLP)
jin530/SWalk
This is the official code for the WSDM 2022 paper: 'S-Walk: Accurate and Scalable Session-based Recommendation with Random Walks'.
ganeshjawahar/interpret_bert
Interpreting Bidirectional Encoder Representations from Transformers (BERT)
luyug/COIL
NAACL2021 - COIL Contextualized Lexical Retriever
Narabzad/Retrieval-Strategy-Selection
allenai/ir_datasets
Provides a common interface to many IR ranking datasets.
castorini/anserini
Anserini is a Lucene toolkit for reproducible information retrieval research
terrier-org/pyterrier
A Python framework for performing information retrieval experiments, building on http://terrier.org/
terrierteam/pyterrier_colbert
caiyinqiong/Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
maygodwithu/TRMD
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
castorini/duobert
Multi-stage passage ranking: monoBERT + duoBERT
sebastian-hofstaetter/teaching
Open-Source Information Retrieval Courses @ TU Wien
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)