vector-embeddings

There are 92 repositories under vector-embeddings topic.

  • SeaGOAT

    kantord/SeaGOAT

    local-first semantic code search engine

    Language:Python1.2k87378
  • ZachNagengast/similarity-search-kit

    πŸ”Ž SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.

    Language:Swift488161845
  • bennyschmidt/next-token-prediction

    Next-token prediction in JavaScript β€” build fast language and diffusion models.

    Language:JavaScript143506
  • youtube-gpt

    vdutts7/youtube-gpt

    YouTubeGPT β€’ AI Chat with 100+ videos ft. YouTuber Marques Brownlee (@ MKBHD) βš‘οΈπŸ”΄πŸ€–πŸ’¬

    Language:TypeScript682017
  • cwida/PDX

    ⚑ Faster similarity search with PDX: A vertical data layout for vectors

    Language:C++555
  • fpgmaas/pypi-scout

    Find Python Packages on PyPI with the help of vector embeddings

    Language:Python47131
  • vdutts7/constitutionGPT

    AI chat over the US Constitution πŸ“œ πŸ’¬ πŸ‡ΊπŸ‡Έ

    Language:TypeScript44205
  • alash3al/vecdb

    a vector embedding database with multiple storage engines and AI embedding integrations

    Language:Go33103
  • ai-mreflow

    vdutts7/ai-mreflow

    YouTubeGPT β€’ AI Chat with 100+ videos ft. YouTuber Matt Wolfe (@mreflow) πŸΊπŸŸ£πŸ€–πŸ’¬

    Language:TypeScript33103
  • madeyexz/markdown-file-query

    Semantic QA with a markdown database: Query any markdown file using vector embedding, Pinecone vector database and GPT (langchain). A weaker version of privateGPT

    Language:Python31103
  • vdutts7/cs186-ai-chat

    UC Berkeley CS186 AI Chatbot πŸ€– πŸ–₯️ 🐻

    Language:TypeScript30105
  • vdutts7/chatBTC

    AI Chat with The β‚Ώitcoin Whitepaper

    Language:TypeScript27104
  • dead8309/ai-rag-crawler

    AI pipeline built with the honc and workers-ai. vector embeddings, web scraping and processing with Cloudflare Workflows (beta)

    Language:TypeScript24201
  • Govind-S-B/pdf-to-text-chroma-search

    Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

    Language:Python23107
  • serp-ai/V3CTRON-vector-database-embedding-neural-search-retrieval-chatgpt-plugin

    V3CTRON | Vector Embeddings Data Retrieval | ChatGPT Plugin

    Language:Python22308
  • kanugurajesh/SoulCare

    SoulCare is a mental health app using NLP to analyze social media sentiment, track symptoms, and offer AI-driven support with personalized reports, document uploads, and symptom-based prioritization.

    Language:TypeScript17101
  • vdutts7/ee16b-ai-chat

    UC Berkeley EE16B AI Chatbot πŸ€– πŸ–₯️ 🐻

    Language:TypeScript16107
  • liuliuOD/Documentation-Embedding

    This tool provides a fast and efficient way to convert text into vector embeddings and store them in the Qdrant search engine. Built with Rust, this tool is designed to handle large datasets and deliver lightning-fast search results.

    Language:Rust15100
  • berecat/openai-pinecone-search

    Semantic search with openai's embeddings stored to pineconedb (vector database)

    Language:TypeScript14102
  • deadbits/vector-embedding-api

    Flask API for generating text embeddings using OpenAI or sentence_transformers

    Language:Python14251
  • abhishekHegde2000/ai-note-app

    Developed an AI-powered note-taking application using Next.js 14, ChatGPT API, vector embeddings, Pinecone, TailwindCSS, Shadcn UI, and TypeScript.

    Language:TypeScript13213
  • monish-prabhu/Intra-Search

    A tool for performing semantic search within pdf documents leveraging sentence transformers.

    Language:Python13101
  • Snehil-Shah/Multimodal-Image-Search-Engine

    Text to Image & Reverse Image Search Engine built upon Vector Similarity Search utilizing CLIP VL-Transformer for Semantic Embeddings & Qdrant as the Vector-Store

    Language:Jupyter Notebook11203
  • angus-spence/loc2vec

    Learning semantic embeddings from OSM data: A Pytorch implementation of the loc2vec general method outlined in: https://sentiance.com/loc2vec-learning-location-embeddings-w-triplet-loss-networks.

    Language:Python9101
  • build-smarter-chatbots-workshop

    m-abdelwahab/build-smarter-chatbots-workshop

    Build an AI-powered chatbot that is able to access external data to provide the most accurate answer

    Language:TypeScript8101
  • Dr-Hutchinson/nicolay

    Nicolay is a digital history experiment that uses artificial intelligence to explore the speeches of Abraham Lincoln.

    Language:Python6300
  • piyush-eon/ai-portfolio-nextjs

    Build an AI Driven Portfolio App with NextJS and Tailwind CSS. We will learn advance AI Technologies like vector embedding and vector databases along with how to work with Open AI's APIs. This is an amazing project to impress recruiters a lot and showcase your skillset.

    Language:JavaScript6105
  • baronet2/Bike2Vec

    Vector Embedding Representations of Road Cycling Riders and Races

    Language:Jupyter Notebook5200
  • pashpashpash/python-rag-scaffold

    A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.

    Language:Python5302
  • worldbeater/code-vecs

    Code for the methods and algorithms described in the paper "Analysis of Program Representations Based on Abstract Syntax Trees and Higher-Order Markov Chains for Source Code Classification Task"

    Language:Jupyter Notebook5100
  • sfoteini/image-vector-search-azure-postgresql

    Image Vector Similarity Search with Azure AI Vision (Florence model) and Azure Cosmos DB for PostgreSQL

    Language:Jupyter Notebook4102
  • sueszli/vector-database-benchmark

    paper: vecdb benchmark stats for dec 2023

  • itsbariscan/ContextBridge-Semantic-Internal-Link-Tool

    ContextBridge-Semantic-Internal-Link-Tool is an advanced Python script designed to enhance website structure and user experience by identifying and suggesting intelligent internal linking opportunities.

    Language:Python3101
  • itsbariscan/SEO-Semantix

    The SEO Content Analyzer is a sophisticated Python script designed to perform in-depth semantic analysis of content for SEO purposes.

    Language:Python3100
  • kaloslazo/PyFuseDB

    Database system that combines structured data retrieval through inverted indexes with unstructured data (images, audio) search using multidimensional vector embeddings, all within a unified platform.

    Language:Python3100
  • KristianMSchmidt/semantic-art-search

    Semantic Art Search – Discover art through meaning, not just keywords.

    Language:Python3