semantic-search

There are 462 repositories under semantic-search topic.

  • generative-ai-for-beginners

    microsoft/generative-ai-for-beginners

    18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

    Language:Jupyter Notebook46.8k4168625.5k
  • mindsdb/mindsdb

    The platform for customizing AI from enterprise data

    Language:Python21.8k3813.8k2.9k
  • typesense/typesense

    Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

    Language:C++18.4k1181.3k569
  • haystack

    deepset-ai/haystack

    :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

    Language:Python14.2k1293.3k1.7k
  • khoj

    khoj-ai/khoj

    Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.

    Language:Python11.1k62393549
  • weaviate

    weaviate/weaviate

    Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

    Language:Go10k1112.3k678
  • txtai

    neuml/txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

    Language:Python7.2k79696509
  • zilliztech/GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

    Language:Python6.6k57155459
  • superduperdb

    SuperDuperDB/superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

    Language:Python4.4k391.1k435
  • marqo

    marqo-ai/marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

    Language:Python4.2k35233180
  • lancedb/lancedb

    Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

    Language:Python3.3k26556218
  • gmpetrov/databerry

    The no-code platform for building custom LLM Agents

    Language:TypeScript2.9k31214386
  • ddangelov/Top2Vec

    Top2Vec learns jointly embedded topic, document and word vectors.

    Language:Python2.9k38327368
  • docarray

    docarray/docarray

    Represent, send, store and search multimodal data

    Language:Python2.8k43637223
  • pinecone-io/examples

    Jupyter Notebooks to help you get hands-on with Pinecone vector databases

    Language:Jupyter Notebook2.5k5944943
  • mazzzystar/Queryable

    Run OpenAI's CLIP model on iOS to search photos.

    Language:Swift2.5k1629392
  • semantra

    freedmand/semantra

    Multi-tool for semantic search

    Language:Python2.3k3257133
  • rom1504/clip-retrieval

    Easily compute clip embeddings and build a clip retrieval system with them

    Language:Jupyter Notebook2.2k23221199
  • awesome-generative-ai

    filipecalegario/awesome-generative-ai

    A curated list of Generative AI tools, works, models, and references

  • usearch

    unum-cloud/usearch

    Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

    Language:C++1.8k22119108
  • embeddings-benchmark/mteb

    MTEB: Massive Text Embedding Benchmark

    Language:Python1.5k8280195
  • NotJoeMartinez/yt-fts

    YouTube Full Text Search - Search all of a YouTube channel from the command line

    Language:Python1.4k107356
  • kernel-memory

    microsoft/kernel-memory

    RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.

    Language:C#1.3k35246242
  • gnes

    gnes-ai/gnes

    GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.

    Language:Python1.3k5423212
  • frutik/awesome-search

    Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

    Language:HTML1.2k6214105
  • model-zoo/shift-ctrl-f

    🔎 Search the information available on a webpage using natural language instead of an exact string match.

    Language:JavaScript1.1k18946
  • fastRAG

    IntelLabs/fastRAG

    Efficient Retrieval Augmentation and Generation Framework

    Language:Python1k112590
  • uform

    unum-cloud/uform

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

    Language:Python933132456
  • aws-genai-llm-chatbot

    aws-samples/aws-genai-llm-chatbot

    A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere) using AWS CDK on AWS

    Language:TypeScript93019260265
  • Dicklesworthstone/swiss_army_llama

    A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

    Language:Python88813648
  • Muennighoff/sgpt

    SGPT: GPT Sentence Embeddings for Semantic Search

    Language:Jupyter Notebook81684251
  • hayabhay/frogbase

    Transform audio-visual content into navigable knowledge.

    Language:Python772174591
  • primeqa/primeqa

    The prime repository for state-of-the-art Multilingual Question Answering research and development.

    Language:Python7072832658
  • koursaros-ai/nboost

    NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)

    Language:Python673177969
  • hamelsmu/code_search

    Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"

    Language:Jupyter Notebook4902423139
  • examples

    jina-ai/examples

    Jina examples and demos to help you get started

    Language:Python45237162142