retrieval
There are 384 repositories under retrieval topic.
chonkie-ai/chonkie
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
apache/lucenenet
Apache Lucene.NET
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
shervinea/mit-15-003-data-science-tools
Study guides for MIT's 15.003 Data Science Tools
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
tensorlakeai/indexify
A realtime serving engine for Data-Intensive Generative AI Applications
superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
michaelthwan/searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
shamangary/awesome-local-global-descriptor
My personal note about local and global descriptor
lucidrains/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
ContextualAI/gritlm
Generative Representational Instruction Tuning
OpenBMB/VisRAG
Parsing-free RAG supported by VLMs
redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
memodb-io/memobase
Profile-Based Long-Term Memory for AI Applications
AkariAsai/learning_to_retrieve_reasoning_paths
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking locally
KarelDO/xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
SapienzaNLP/relik
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
raphaelsty/cherche
Neural Search
LongmaoTeamTf/deep_recommenders
Deep Recommenders
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
BUAADreamer/EasyRAG
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
tonywu71/colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳
chao1224/MoleculeSTM
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
meinardmueller/libfmp
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)