vector-embeddings

There are 156 repositories under vector-embeddings topic.

  • SeaGOAT

    kantord/SeaGOAT

    local-first semantic code search engine

    Language:Python1.2k67983
  • ZachNagengast/similarity-search-kit

    🔎 SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.

    Language:Swift493161846
  • dev-diaries41/smartscan

    Search images and videos offline using text or by reverse image search with on-device AI.

    Language:Kotlin33214
  • oldjs/web-code-agent

    Browser-local AI code Q&A engine. Chat with your codebase, ensure privacy, and optimize LLM collaboration.

    Language:TypeScript2672021
  • bennyschmidt/next-token-prediction

    Next-token prediction in JavaScript — build fast language and diffusion models.

    Language:JavaScript143406
  • youtube-gpt

    vdutts7/youtube-gpt

    YouTubeGPT • AI Chat with 100+ videos ft. YouTuber Marques Brownlee (@ MKBHD) ⚡️🔴🤖💬

    Language:TypeScript682017
  • fpgmaas/pypi-scout

    Find Python Packages on PyPI with the help of vector embeddings

    Language:Python47141
  • vdutts7/constitutionGPT

    AI chat over the US Constitution 📜 💬 🇺🇸

    Language:TypeScript44205
  • ai-mreflow

    vdutts7/ai-mreflow

    YouTubeGPT • AI Chat with 100+ videos ft. YouTuber Matt Wolfe (@mreflow) 🐺🟣🤖💬

    Language:TypeScript34103
  • alash3al/vecdb

    a vector embedding database with multiple storage engines and AI embedding integrations

    Language:Go33103
  • madeyexz/markdown-file-query

    Semantic QA with a markdown database: Query any markdown file using vector embedding, Pinecone vector database and GPT (langchain). A weaker version of privateGPT

    Language:Python32104
  • vdutts7/cs186-ai-chat

    UC Berkeley CS186 AI Chatbot 🤖 🖥️ 🐻

    Language:TypeScript31105
  • Rudra-DB/rudradb-opin-examples

    rudradb-opin-examples is for example implementations of the pip install rudradb-opin

  • vdutts7/chatBTC

    AI Chat with The ₿itcoin Whitepaper

    Language:TypeScript27104
  • dead8309/ai-rag-crawler

    AI pipeline built with the honc and workers-ai. vector embeddings, web scraping and processing with Cloudflare Workflows (beta)

    Language:TypeScript26201
  • sashakolpakov/dire-jax

    DImensionality REduction in JAX

    Language:Python242
  • Govind-S-B/pdf-to-text-chroma-search

    Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

    Language:Python23107
  • serp-ai/V3CTRON-vector-database-embedding-neural-search-retrieval-chatgpt-plugin

    V3CTRON | Vector Embeddings Data Retrieval | ChatGPT Plugin

    Language:Python22208
  • kanugurajesh/SoulCare

    SoulCare is a mental health app using NLP to analyze social media sentiment, track symptoms, and offer AI-driven support with personalized reports, document uploads, and symptom-based prioritization.

    Language:TypeScript20101
  • abhishekHegde2000/ai-note-app

    Developed an AI-powered note-taking application using Next.js 14, ChatGPT API, vector embeddings, Pinecone, TailwindCSS, Shadcn UI, and TypeScript.

    Language:TypeScript18213
  • Snehil-Shah/Multimodal-Image-Search-Engine

    Text to Image & Reverse Image Search Engine built upon Vector Similarity Search utilizing CLIP VL-Transformer for Semantic Embeddings & Qdrant as the Vector-Store

    Language:Jupyter Notebook16204
  • vdutts7/ee16b-ai-chat

    UC Berkeley EE16B AI Chatbot 🤖 🖥️ 🐻

    Language:TypeScript16107
  • AutomatosAI/automatos-ai

    Automatos AI: Open-source platform for advanced context engineering and multi-agent orchestration in enterprise automation. Built on frontier research in RAG, vector embeddings, cognitive tools, emergent symbols, and neural field theory—powered by FastAPI, Next.js, and PostgreSQL.

    Language:Python15
  • liuliuOD/Documentation-Embedding

    This tool provides a fast and efficient way to convert text into vector embeddings and store them in the Qdrant search engine. Built with Rust, this tool is designed to handle large datasets and deliver lightning-fast search results.

    Language:Rust15100
  • monish-prabhu/Intra-Search

    A tool for performing semantic search within pdf documents leveraging sentence transformers.

    Language:Python15103
  • berecat/openai-pinecone-search

    Semantic search with openai's embeddings stored to pineconedb (vector database)

    Language:TypeScript14102
  • deadbits/vector-embedding-api

    Flask API for generating text embeddings using OpenAI or sentence_transformers

    Language:Python14251
  • TwitterCelebrityMatcher

    ahmedshahriar/TwitterCelebrityMatcher

    Match celebrity users with their respective tweets by making use of Semantic Textual Similarity on over 900+ celebrity users' 2.5 million+ scraped tweets utilizing SBERT, streamlit, tweepy and FastAPI

    Language:Python13202
  • angus-spence/loc2vec

    Learning semantic embeddings from OSM data: A Pytorch implementation of the loc2vec general method outlined in: https://sentiance.com/loc2vec-learning-location-embeddings-w-triplet-loss-networks.

    Language:Python11101
  • KristianMSchmidt/semantic-art-search

    Semantic Art Search – Explore art through meaning-driven search

    Language:Python11
  • piyush-eon/ai-portfolio-nextjs

    Build an AI Driven Portfolio App with NextJS and Tailwind CSS. We will learn advance AI Technologies like vector embedding and vector databases along with how to work with Open AI's APIs. This is an amazing project to impress recruiters a lot and showcase your skillset.

    Language:JavaScript9107
  • build-smarter-chatbots-workshop

    m-abdelwahab/build-smarter-chatbots-workshop

    Build an AI-powered chatbot that is able to access external data to provide the most accurate answer

    Language:TypeScript8101
  • baronet2/Bike2Vec

    Vector Embedding Representations of Road Cycling Riders and Races

    Language:Jupyter Notebook6300
  • Dr-Hutchinson/nicolay

    Nicolay is a digital history experiment that uses artificial intelligence to explore the speeches of Abraham Lincoln.

    Language:Python6300
  • dev-diaries41/smartscan-cli

    A Linux CLI tool powered by CLIP that enables comparison and automated organization of image and text files based on content similarity

    Language:Python50
  • worldbeater/code-vecs

    Code for the methods and algorithms described in the paper "Analysis of Program Representations Based on Abstract Syntax Trees and Higher-Order Markov Chains for Source Code Classification Task"

    Language:Jupyter Notebook5100