similarity-search

There are 437 repositories under similarity-search topic.

  • qdrant

    qdrant/qdrant

    Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

    Language:Rust22.8k1301.4k1.6k
  • typesense/typesense

    Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

    Language:C++22.6k1321.6k707
  • weaviate

    weaviate/weaviate

    Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

    Language:Go13k1292.5k910
  • zilliztech/GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

    Language:Python7.5k56175530
  • paradedb/paradedb

    Postgres for Search and Analytics

    Language:Rust6.9k43550234
  • lancedb/lancedb

    Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

    Language:Rust6k33931432
  • gerev

    GerevAI/gerev

    🧠 AI-powered enterprise search engine 🔎

    Language:Python2.8k2039182
  • hora

    hora-search/hora

    🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .

    Language:Rust2.6k483075
  • usearch

    unum-cloud/usearch

    Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

    Language:C++2.6k34192172
  • sherlockchou86/VideoPipe

    A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )

    Language:C++1.6k2633235
  • vald

    vdaas/vald

    Vald. A Highly Scalable Distributed Vector Search Engine

    Language:Go1.6k1810481
  • datastax/jvector

    JVector: the most advanced embedded vector search engine

    Language:Java1.6k32138122
  • currentslab/awesome-vector-search

    Collections of vector search related libraries, service and research papers

  • SimSIMD

    ashvardanian/SimSIMD

    Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

    Language:C1.3k2010475
  • tensorflow/similarity

    TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

    Language:Python1k27190106
  • myscale/MyScaleDB

    A @ClickHouse fork that supports high-performance vector search and full-text search.

    Language:C++947141657
  • tantaraio/voy

    🕸️🦀 A WASM vector similarity search written in Rust

    Language:Rust940144235
  • shibing624/similarities

    Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

    Language:Python84084181
  • quaterion

    qdrant/quaterion

    Blazing fast framework for fine-tuning similarity learning models

    Language:Python65898145
  • ekzhu/SetSimilaritySearch

    All-pair set similarity search on millions of sets in Python and on a laptop

    Language:Python592181041
  • thulab/DeepHash

    An Open-Source Package for Deep Learning to Hash (DeepHash)

    Language:Python5612256126
  • hhblaze/DBreeze

    C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

    Language:C#551427260
  • ChunelFeng/caiss

    一款简单好用的 跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。欢迎star & fork。Build together! Power another !

    Language:C++54091165
  • arcadedb

    ArcadeData/arcadedb

    ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

    Language:Java5381677268
  • qdrant/awesome-metric-learning

    😎 A curated list of awesome practical Metric Learning and its applications

  • CLUEbenchmark/KgCLUE

    KgCLUE: 大规模中文开源知识图谱问答

    Language:Python44541360
  • m1guelpf/tinyvector

    A tiny embedding database in pure Rust.

    Language:Rust3998220
  • alexklibisz/elastiknn

    Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

    Language:Scala379714149
  • aquila

    Aquila-Network/aquila

    An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.

    Language:HTML377204225
  • rapidsai/cuvs

    cuVS - a library for vector search and clustering on the GPU

    Language:Cuda3581626292
  • edwinkys/oasysdb

    In-memory vector store with efficient read and write performance for semantic caching and retrieval system. Redis for Semantic Caching.

    Language:Rust35733913
  • caoyue10/DeepHash-Papers

    Must-read papers on deep learning to hash (DeepHash)

  • derrickburns/generalized-kmeans-clustering

    Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.

    Language:HTML30016550
  • fzliu/radient

    Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

    Language:Python2753111
  • dangkhoasdc/awesome-vector-database

    A curated list of awesome works related to high dimensional structure/vector search & database

  • vioshyvo/mrpt

    Fast and lightweight header-only C++ library (with Python bindings) for approximate nearest neighbor search

    Language:C++259131647