Awesome Vector Search

This repo collects papers, documents, and codes about vector search for anyone who wants to research it. We are continuously improving the project. Welcome to PR the works (papers, repositories) that the repo misses.

Papers

last updated: 2025-09-24

2025

  • [arXiv] Graph-Based Approximate Nearest Neighbor Search Revisited: Theoretical Analysis and Optimization [Graph] [Theory]

  • [arXiv] SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation [Compression]

  • [arXiv] Towards the Distributed Large-scale k-NN Graph Construction by Graph Merge [Graph] [Distributed]

  • [arXiv] SINDI: an Efficient Index for Approximate Maximum Inner Product Search on Sparse Vectors [Sparse]

  • [arXiv] Filtered Approximate Nearest Neighbor Search: A Unified Benchmark and Systematic Experimental Study [Experiment, Analysis & Benchmark] [Filter] [code] Github Repo stars

  • [arXiv] DISTRIBUTEDANN: Efficient Scaling of a Single DISKANN Graph Across Thousands of Computers [Graph] [Distributed]

  • [arXiv] BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search [Graph] [Hardware]

  • [VLDB] Select Edges Wisely: Monotonic Path Aware Graph Layout Optimization for Disk-Based ANN Search [code] GitHub Repo stars

  • [SIGMOD] WoW: A Window-to-Window Incremental Index for Range-Filtering Approximate Nearest Neighbor Search [Filter] [code] GitHub Repo stars

  • [arXiv] PGTuner: An Efficient Framework for Automatic and Transferable Configuration Tuning of Proximity Graphs ['Tuning] [Graph]

  • [arXiv] TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning [DCO]

  • [arXiv] Accelerating High-Dimensional Nearest Neighbor Search with Dynamic Query Preference [Tuning]

  • [SIGIR Tutorial] Neural Lexical Search with Learned Sparse Retrieval [Sparse]

  • [arXiv] CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search [Learning] [code] GitHub Repo stars

  • [arXiv] Balancing the Blend: An Experimental Analysis of Trade-offs in Hybrid Search [Hybrid] [code] GitHub Repo stars

  • [arXiv] Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors [Filter] [Survey] [code] GitHub Repo stars

  • [arXiv] CleANN: Efficient Full Dynamism in Graph-based Approximate Nearest Neighbor Search [Streaming]

  • [OSDI] Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm with SSD [Graph] [Hardware]

  • [VLDB] SIEVE: Effective Filtered Vector Search with Collection of Indexes [Filter] [code] GitHub Repo stars

  • [VLDB] Breaking the Storage-Compute Bottleneck in Billion-Scale ANNS: A GPU-Driven Asynchronous I/O Framework [Hardware] [code] GitHub Repo stars

  • [arXiv] Towards Efficient and Scalable Distributed Vector Search with RDMA [Hardware]

  • [arXiv] OneDB: A Distributed Multi-Metric Data Similarity Search System [Multimodel]

  • [arXiv] Towards Robustness: A Critique of Current Vector Database Assessments [Experiment]

  • [ISCA] REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing [Hardware]

  • [KDD] Empowering Graph-based Approximate Nearest Neighbor Search with Adaptive Awareness Capabilities [Graph]

  • [SIGMOD] HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search [Distributed]

  • [SIGMOD] Efficient Dynamic Indexing for Range Filtered Approximate Nearest Neighbor Search [Filter] [code] GitHub Repo stars

  • [SIGMOD] DIGRA: A Dynamic Graph Indexing for Approximate Nearest Neighbor Search with Range Filter [Filter]

  • [SIGMOD] MIRAGE-ANNS: Mixed Approach Graph-based Indexing for Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [arXiv] EnhanceGraph: A Continuously Enhanced Graph-based Index for High-dimensional Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [arXiv] LEANN: A Low-Storage Vector Index [Hardware]

  • [OSDI] Quake: Adaptive Indexing for Vector Search [Streaming] [code] GitHub Repo stars

  • [SIGMOD] VecFlow: A High-Performance Vector Data Management System for Filtered-Search on GPUs [Filter] [Hardware] [code] GitHub Repo stars

  • [arXiv] VIBE: Vector Index Benchmark for Embeddings [Benchmark] [code] GitHub Repo stars

  • [arXiv] Cosmos: A CXL-Based Full In-Memory System for Approximate Nearest Neighbor Search [Hardware]

  • [arXiv] HAKES: Scalable Vector Database for Embedding Search Service [Streaming] [code] GitHub Repo stars

  • [arXiv] Bang for the Buck: Vector Search on Cloud CPUs [Experiment]

  • [arXiv] Survey of Filtered Approximate Nearest Neighbor Search over the Vector-Scalar Hybrid Data [Survey] [Filter] [code] GitHub Repo stars

  • [arXiv] RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference [KVCache] [code] GitHub Repo stars

  • [arXiv] Efficient Graph-Based Approximate Nearest Neighbor Search Achieving: Low Latency Without Throughput Loss [Graph] [Serving]

  • [arXiv] MINT: Multi-Vector Search Index Tuning [Tuning]

  • [arXiv] TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate [Compression]

  • [SIGIR] Dynamic Superblock Pruning for Fast Learned Sparse Retrieval [Sparse] [code] GitHub Repo stars

  • [SIGIR] Stitching Inner Product and Euclidean Metrics for Topology-aware Maximum Inner Product Search [MIPS] [code] GitHub Repo stars

  • [SIGMOD] RWalks: Random Walks as Attribute Diffusers for Filtered Vector Search [Filter] [Graph]

  • [SIGMOD] AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference [KVCache]

  • [SIGMOD] MicroNN: An On-device Disk-resident Updatable Vector Database [Hardware]

  • [arXiv] ESG: Elastic Graphs for Range-Filtering Approximate 𝑘-Nearest Neighbor Search [Filter]

  • [arXiv] LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search [Tuning]

  • [arXiv] VSAG: An Optimized Search Framework for Graph-based Approximate Nearest Neighbor Search [Framework] [code] GitHub Repo stars

  • [VLDB] Maximum Inner Product is Query-Scaled Nearest Neighbor [MIPS] [code] GitHub Repo stars

  • [SIGMOD] PDX: A Data Layout for Vector Similarity Search [Hardware] [code] GitHub Repo stars

  • [ICML] Sort Before You Prune: Improved Worst-Case Guarantees of the DiskANN Family of Graphs [Graph]

  • [WWW'25] Scalable Overload-Aware Graph-Based Index Construction for 10-Billion-Scale Vector Similarity Search [Graph]

  • [SIGMOD] Accelerating Graph Indexing for ANNS on Modern CPUs [Graph] [code] GitHub Repo stars

  • [arXiv] In-Place Updates of a Graph Index for Streaming Approximate Nearest Neighbor Search [Streaming]

  • [ICML] Graph-Based Algorithms for Diverse Similarity Search [Filter] [Graph] [code] GitHub Repo stars

  • [arXiv] Inference-time sparse attention with asymmetric indexing [KVCache]

  • [SIGMOD] DEG: Efficient Hybrid Vector Search Using the Dynamic Edge Navigation Graph [Filter] [code] GitHub Repo stars

  • [arXiv] Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs [KVCache] [code] GitHub Repo stars

  • [arXiv] Scalable k-Means Clustering for Large k via Seeded Approximate Nearest-Neighbor Search [Clustering] [code] GitHub Repo stars

  • [SIGMOD] Graph-Based Vector Search: An Experimental Evaluation of the State-of-the-Art [Graph]

  • [arXiv] Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search [Compression] [code] GitHub Repo stars

  • [ICLR] Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks [Compression] [code] GitHub Repo stars

  • [ICDE] Boosting Accuracy and Efficiency for Vector Retrieval with Local Scaling Graph [Graph]

2024

  • [VLDB] UNIFY: Unified Index for Range Filtered Approximate Nearest Neighbors Search [Filter] [code] GitHub Repo stars

  • [SIGMOD] Navigating Labels and Vectors: A Unified Approach to Filtered Approximate Nearest Neighbor Search [Filter] [code] GitHub Repo stars

  • [VLDB] Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search [DCO] [code] GitHub Repo stars

  • [SIGMOD] Subspace Collision: An Efficient and Accurate Framework for High-dimensional Approximate Nearest Neighbor Search [DCO]

  • [SIGMOD] SymphonyQG: Towards Symphonious Integration of Quantization and Graph for Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [arXiv] Efficient and Effective Retrieval of Dense-Sparse Hybrid Vectors using Graph-based Approximate Nearest Neighbor Search [Hybrid] [Graph]

  • [NeurIPS] LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search [Compression] [code] GitHub Repo stars

  • [VLDB] Revisiting the Index Construction of Proximity Graph-Based Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [FAST] FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search

  • [arXiv] RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval [KVCache]

  • [SIGMOD] Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search [Compression] [code] GitHub Repo stars

  • [SIGMOD] iRangeGraph: Improvising Range-dedicated Graphs for Range-filtering Nearest Neighbor Search [Filter] [code] GitHub Repo stars

  • [VLDB] Steiner-Hardness: A Query Hardness Measure for Graph-Based ANN Indexes [Graph]

  • [VLDB] RoarGraph: A Projected Bipartite Graph for Efficient Cross-Modal Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [VLDB] SingleStore-V: An Integrated Vector Database System in SingleStore [Database]

  • [CIKM] Pairing Clustered Inverted Indexes with kNN Graphs for Fast Approximate Retrieval over Learned Sparse Representations [Sparse]

  • [CIKM] A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search [Hardware] [Streaming]

  • [SIGIR] Efficient Approximate Nearest Neighbor Search on a Raspberry Pi [Hardware]

  • [ATC] Scalable Billion-point Approximate Nearest Neighbor Search Using SmartSSDs [Hardware]

  • [arXiv] CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion [Benchmark] [Streaming] [code] GitHub Repo stars

  • [SIGMOD] Vexless: A Serverless Vector Data Management System Using Cloud Functions [Serverless]

  • [SIDMOG] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search [Compression] [code] GitHub Repo stars

  • [VLDB] DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search [LSH] [code] GitHub Repo stars

  • [ICDE] Effective and General Distance Computation for Approximate Nearest Neighbor Search [DCO] [code] GitHub Repo stars

  • [arXiv] AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval [Hardware] [code] GitHub Repo stars

  • [SIGIR] Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations [Sparse] [code] GitHub Repo stars

  • [SIGMOD] GTS: GPU-based Tree Index for Fast Similarity Search [Hardware] [code] GitHub Repo stars

  • [arXiv] Dimensionality-Reduction Techniques for Approximate Nearest Neighbor Search: A Survey and Evaluation [Survey]

  • [SIGMOD] ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data [Filter]

  • [ICML] Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [ICML] Approximate Nearest Neighbor Search with Window Filters [Filter] [code] GitHub Repo stars

  • [ICML] Residual Quantization with Implicit Neural Codebooks [Compression] [code] GitHub Repo stars

  • [arXiv] BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU [Hardware] [code] GitHub Repo stars

  • [arXiv] Curator: Efficient Indexing for Multi-Tenant Vector Databases [Multitenancy]

  • [SIGMOD] Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment [Hardware] [Graph] [code] GitHub Repo stars

2023

  • [NeurIPS] An Efficient and Robust Framework for Approximate Nearest Neighbor Search with Attribute Constraint [Filter]

  • [ICDE] MUST: An Effective and Scalable Framework for Multimodal Search of Target Modality [Multimodal] [code] GitHub Repo stars

  • [MM] Relative NN-Descent: A Fast Index Construction for Graph-Based Approximate Nearest Neighbor Search [Graph] [code] GitHub Repo stars

  • [VLDB] Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models [Hardware] [code] GitHub Repo stars

  • [WWW] Filtered-DiskANN: Graph Algorithms for Approximate Nearest Neighbor Search with Filters [Filter]

  • [ATC] CXL-ANNS: Software-Hardware Collaborative Memory Disaggregation and Computation for Billion-Scale Approximate Nearest Neighbor Search [Hardware] [Distributed]

  • [SOSP] SPFresh: Incremental In-Place Update for Billion-Scale Vector Search [Streaming]

  • [OSDI] VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity [Filter] [code] GitHub Repo stars

  • [arXiv] CAPS: A Practical Partition Index for Filtered Similarity Search [Filter] [code] Github Repo stars