retrieval
There are 303 repositories under retrieval topic.
apache/lucenenet
Apache Lucene.NET
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
shervinea/mit-15-003-data-science-tools
Study guides for MIT's 15.003 Data Science Tools
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
SciPhi-AI/R2R
Build and deploy a fully-featured, observable, user-facing RAG backend in minutes.
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
shamangary/awesome-local-global-descriptor
My personal note about local and global descriptor
lucidrains/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
michaelthwan/searchGPT
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
tensorlakeai/indexify
A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ContextualAI/gritlm
Generative Representational Instruction Tuning
AkariAsai/learning_to_retrieve_reasoning_paths
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
LongmaoTeamTf/deep_recommenders
Deep Recommenders
KarelDO/xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
raphaelsty/cherche
Neural Search
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
chao1224/MoleculeSTM
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
meinardmueller/libfmp
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Anush008/fastembed-rs
Library to generate vector embeddings, reranking. Based on Qdrant's FastEmbed.
m-bain/CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
rom1504/image_embeddings
Using efficientnet to provide embeddings for retrieval
mendableai/rag-arena
Open-source RAG evaluation through users' feedback
luyug/COIL
NAACL2021 - COIL Contextualized Lexical Retriever
ARM-DOE/ACT
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
protyposis/AudioAlign
Audio Synchronization and Analysis Tool
luyug/GC-DPR
Train Dense Passage Retriever (DPR) with a single GPU