retrieval

There are 495 repositories under retrieval topic.

VectifyAI/PageIndex
📑 PageIndex: Document Index for Reasoning-based RAG
Language:Python3.9k 29 20278
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language:Python3k 17 1.3k501
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Language:Python2.5k 19 231164
apache/lucenenet
Apache Lucene.NET
Language:C#2.3k 165 395649
memodb-io/memobase
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.
Language:Python2.3k 15 56170
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.2k 25 166216
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Language:Python2k 20 150222
shervinea/mit-15-003-data-science-tools
Study guides for MIT's 15.003 Data Science Tools
1.9k 64 3371
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Language:Python1.5k 18 53194
superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Language:Jupyter Notebook1.4k 28 57110
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Language:Python1.4k 5 5481
tensorlakeai/indexify
A realtime serving engine for Data-Intensive Generative AI Applications
Language:Rust1.1k 17 260139
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python1k 12 111137
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Language:Python875 24 34110
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language:Jupyter Notebook873 8 4253
epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
Language:C++866 6 2742
NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Language:Python864 8 1450
OpenBMB/VisRAG
Parsing-free RAG supported by VLMs
Language:Python853 11 6668
AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Language:Python829 20 5992
michaelthwan/searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Language:Python704 11 5174
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook678 9 5848
Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking. Re-write of qdrant/fastembed.
Language:Rust652 7 8388
shamangary/awesome-local-global-descriptor
My personal note about local and global descriptor
650 56 1395
lucidrains/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Language:Python634 10 1347
EdoardoBotta/RQ-VAE-Recommender
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
Language:Python613 2 4186
BUAADreamer/EasyRAG
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
Language:Python593 3 1374
redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Language:Python554 7 1374
DataScienceUIBK/Rankify
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.
Language:Python520 12 1139
SapienzaNLP/relik
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Language:Python470 6 3336
KarelDO/xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Language:Python442 23 924
AkariAsai/learning_to_retrieve_reasoning_paths
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Language:Python434 16 2966
Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Language:HTML380 20 4225
tonywu71/colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
339 7 328
raphaelsty/cherche
Neural Search
Language:Python334 7 914
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
Language:Python330 10 3145
LongmaoTeamTf/deep_recommenders
Deep Recommenders
Language:Python330 6 7108

retrieval

VectifyAI/PageIndex

embeddings-benchmark/mteb

qdrant/fastembed

apache/lucenenet

memodb-io/memobase

intel/intel-extension-for-transformers

beir-cellar/beir

shervinea/mit-15-003-data-science-tools

parthsarthi03/raptor

superlinked/superlinked

xhluca/bm25s

tensorlakeai/indexify

ArrowLuo/CLIP4Clip

lucidrains/RETRO-pytorch

Muennighoff/sgpt

epsilla-cloud/vectordb

NeumTry/NeumAI

OpenBMB/VisRAG

AnswerDotAI/byaldi

michaelthwan/searchGPT

ContextualAI/gritlm

Anush008/fastembed-rs

shamangary/awesome-local-global-descriptor

lucidrains/memorizing-transformers-pytorch

EdoardoBotta/RQ-VAE-Recommender

BUAADreamer/EasyRAG

redis-developer/ArXivChatGuru

DataScienceUIBK/Rankify

SapienzaNLP/relik

KarelDO/xmc.dspy

AkariAsai/learning_to_retrieve_reasoning_paths

Aquila-Network/aquila

tonywu71/colpali-cookbooks

raphaelsty/cherche

arcee-ai/DALM

LongmaoTeamTf/deep_recommenders