embeddings-similarity

There are 61 repositories under embeddings-similarity topic.

  • qdrant

    qdrant/qdrant

    Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

    Language:Rust26k1461.6k1.8k
  • featureform

    featureform/featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

    Language:Go1.9k1515999
  • epsilla-cloud/vectordb

    Epsilla is a high performance Vector Database Management System

    Language:C++86362740
  • m1guelpf/tinyvector

    A tiny embedding database in pure Rust.

    Language:Rust4198220
  • embedding_studio

    EulerSearch/embedding_studio

    Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.

    Language:Python382665
  • marcominerva/ChatGptNet

    A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

    Language:C#316156038
  • dangkhoasdc/awesome-vector-database

    A curated list of awesome works related to high dimensional structure/vector search & database

  • pentoai/vectory

    Vectory provides a collection of tools to track and compare embedding versions.

    Language:Python71550
  • louisbrulenaudet/ragoon

    High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡

    Language:Jupyter Notebook67517
  • farouqzaib/mettis

    Vector Database implemented in Golang with support for full-text and vector search as well as fault tolerance via Raft.

    Language:Go61204
  • handrew/gpt-memory

    Using embeddings to create memory.

    Language:Python31210
  • fm1320/song-vibe

    AI song recommendations based on the feel of a song

    Language:Python22130
  • tdeshazo/passage-probe

    A command-line tool to index and perform hybrid semantic & lexical search over text files

    Language:Python15
  • Babelscape/CroCoAlign

    A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.

    Language:Python9320
  • MahmoudAbdelRahman/build2Vec

    Building representation in the vector space

    Language:Python9101
  • spartypkp/legalAI

    LegalAI is a passion project which explores and simplifies the complexities of obtaining legal information using LLMs.

    Language:Jupyter Notebook9101
  • TeamEpicProjects/Practical-LLM-and-GPT-Applications

    The repository is aimed at providing practical examples and resources for developers and researchers interested in applying LM and GPT models to real-world NLP problems.

    Language:Jupyter Notebook9102
  • France-Travail/embcompare

    A simple python tool for embedding comparison

    Language:Python7810
  • manthan89-py/Real-Time-Social-Media-Content-Retrievel-System

    The Real Time Social Media Content Retrieval System fetches real-time LinkedIn posts based on user queries, offering multiple post retrieval and customization options. Although initially focused on LinkedIn, it can be expanded to incorporate other social media platforms, facilitating cross-channel post similarity searches.

    Language:Python7121
  • gururaser/RoleRadar

    RoleRadar turns free-form requests like “Data Analyst roles in New York with SQL experience.” into structured filters and semantic-vector queries, delivering spot-on matches in seconds.

    Language:Jupyter Notebook6
  • Masetto96/music-collection-analyzer

    An essentia-based tool for extracting features from a collection of audio files. Two simple user interfaces, to create playlists and explore track similarities based on extracted audio features and embeddings.

    Language:Python4101
  • eu90h/semantic-dictionary

    A Python dictionary that uses semantic similarity for key matching instead of exact matches. This library allows you to retrieve values using keys that are semantically similar to the ones stored, making it ideal for natural language interfaces, etc.

    Language:Python3
  • lablab-ai/Vector-Similarity-Search-with-Redis-Quickstart-Notebook

    Vector similarity can be used to find similar products, articles and much more. In this tutorial, we will show you how to use Redis to index and search for similar vectors

    Language:Jupyter Notebook3303
  • MohammedAly22/Sentiment-Analysis-for-Homonyms-Problem

    A comprehensive examination is conducted to assess the influence of homonyms in sentiment analysis, employing two distinct techniques: fixed embeddings (LSTM) and contextualized embeddings (DistilBERT).

    Language:Jupyter Notebook3200
  • glove.c

    shubham0204/glove.c

    Simple, cross-platform port of GloVe embeddings, written in C

    Language:C3100
  • hubmapconsortium/asctb-ct-label-mapper

    asctb-ct-label-mapper: A package to recommend controlled vocabulary for annotations of scRNA-seq datasets. and thereby enable cross-dataset or cross-experiment comparison of annotations.

    Language:Python2431
  • jaypinho/transcript-accuracy

    A Streamlit app to evaluate the accuracy of automatic speech recognition (ASR) transcription services.

    Language:Python2101
  • Med-Karim-Ben-Boubaker/localume

    Localume is a powerful desktop application that enables semantic search across your documents using advanced vector embeddings and retrieval technology. The application monitors specified directories in real-time, automatically indexing new and modified files to maintain an up-to-date searchable database.

    Language:Python20
  • petermchale/llm-powered-applications

    Orchestrating the interaction between users and Large Language Models

    Language:Jupyter Notebook2101
  • sheriff1max/recs-searcher

    Python library for correcting registry and spelling errors in user input when comparing with a database of texts.

    Language:Python21500
  • cgast/embird

    An open-source project for crawling RSS feeds and websites, extracting news content, and storing it with vector embeddings for semantic search, clustering and visualization..

    Language:Python10
  • harehimself/pinecone-lab

    Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, benchmark performance across different embedding models, and better understand what is possible with vector search in production environments.

    Language:Python1
  • korupolujayanth2004/Rag

    RAG Mini Project — Retrieval‑Augmented Generation chatbot with FastAPI backend (Docker on Hugging Face Spaces) and Streamlit frontend (Render), featuring document ingestion, vector search, and LLM‑powered answers

    Language:Python1
  • MinLee0210/evento

    Building an Event Retrieval System from Visual Data participating in Ho Chi Minh's AI Challenge in 2024

    Language:Python110
  • WetroLearn-TextEmbeddings

    wetrocloud/WetroLearn-TextEmbeddings

    A Streamlit app to visualize text similarity using embeddings and cosine distance. Compare and analyze texts interactively!

    Language:Python100