retrieval
There are 462 repositories under retrieval topic.
VectifyAI/PageIndex
📑 PageIndex: Document Index for Reasoning-based RAG
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
apache/lucenenet
Apache Lucene.NET
memodb-io/memobase
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
shervinea/mit-15-003-data-science-tools
Study guides for MIT's 15.003 Data Science Tools
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
tensorlakeai/indexify
A realtime serving engine for Data-Intensive Generative AI Applications
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
OpenBMB/VisRAG
Parsing-free RAG supported by VLMs
AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
michaelthwan/searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
ContextualAI/gritlm
Generative Representational Instruction Tuning
Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking. Re-write of qdrant/fastembed.
shamangary/awesome-local-global-descriptor
My personal note about local and global descriptor
lucidrains/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
EdoardoBotta/RQ-VAE-Recommender
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
BUAADreamer/EasyRAG
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
DataScienceUIBK/Rankify
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.
SapienzaNLP/relik
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
KarelDO/xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
AkariAsai/learning_to_retrieve_reasoning_paths
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
tonywu71/colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳
raphaelsty/cherche
Neural Search
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
LongmaoTeamTf/deep_recommenders
Deep Recommenders