embeddings

There are 2881 repositories under embeddings topic.

  • supabase

    supabase/supabase

    The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

    Language:TypeScript88.6k5984.7k9.8k
  • chroma-core/chroma

    Open-source search and retrieval database for AI applications.

    Language:Rust23.3k1051.4k1.8k
  • Embedding/Chinese-Word-Vectors

    100+ Chinese Word Vectors 上百种预训练中文词向量

    Language:Python12.1k2831682.3k
  • h2oai/h2ogpt

    Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

    Language:Python11.9k1561.2k1.3k
  • txtai

    neuml/txtai

    💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

    Language:Python11.6k111916740
  • FlagOpen/FlagEmbedding

    Retrieval and Retrieval-augmented LLMs

    Language:Python10.5k531.2k782
  • langchain4j/langchain4j

    LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.

    Language:Java9k1061.6k1.7k
  • apache/seatunnel

    SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.

    Language:Java8.8k1724.2k2.1k
  • postgresml/postgresml

    Postgres with GPUs for ML/AI apps.

    Language:Rust6.5k55256337
  • pytorch-metric-learning

    KevinMusgrave/pytorch-metric-learning

    The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

    Language:Python6.2k62523665
  • Tencent/WeKnora

    LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

    Language:Go5.6k37206599
  • lancedb/lance

    Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

    Language:Rust5.4k511.6k462
  • text2vec

    shibing624/text2vec

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Language:Python4.8k31154418
  • AutoRAG

    Marker-Inc-Korea/AutoRAG

    AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

    Language:Python4.3k33627338
  • obsidian-smart-connections

    brianpetro/obsidian-smart-connections

    Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

    Language:JavaScript4.1k41826246
  • huggingface/text-embeddings-inference

    A blazing fast inference solution for text embeddings models

    Language:Rust4k39326307
  • lightly-ai/lightly

    A python library for self-supervised learning on images.

    Language:Python3.5k28589310
  • tensorflow/hub

    A library for transfer learning by reusing parts of TensorFlow models.

    Language:Python3.5k1507101.6k
  • towhee-io/towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

    Language:Python3.4k29670262
  • awesome-generative-ai

    filipecalegario/awesome-generative-ai

    A curated list of Generative AI tools, works, models, and references

  • hegelai/prompttools

    Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

    Language:Python2.9k3060246
  • crmne/ruby_llm

    One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, GPUStack & OpenAI compatible APIs. Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.

    Language:Ruby2.9k21185254
  • ml-surveys

    eugeneyan/ml-surveys

    📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.

  • SamurAIGPT/EmbedAI

    An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

    Language:JavaScript2.8k3569299
  • datachain

    iterative/datachain

    ETL, Analytics, Versioning for Unstructured Data

    Language:Python2.7k17347124
  • qdrant/fastembed

    Fast, Accurate, Lightweight Python library to make State of the Art Embedding

    Language:Python2.4k17195154
  • axinc-ai/ailia-models

    The collection of pre-trained, state-of-the-art AI models for ailia SDK

    Language:Python2.3k53806347
  • milvus-io/bootcamp

    Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

    Language:Jupyter Notebook2.2k35266645
  • PetrochukM/PyTorch-NLP

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    Language:Python2.2k5469255
  • vearch

    vearch/vearch

    Distributed vector search for AI-native applications

    Language:Go2.2k76585348
  • google/generative-ai-docs

    Documentation for Google's Gen AI site - including the Gemini API and Gemma

    Language:Jupyter Notebook2.1k78126679
  • vector-admin

    Mintplex-Labs/vector-admin

    The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

    Language:TypeScript2k2794323
  • xlang-ai/instructor-embedding

    [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

    Language:Python2k18113155
  • featureform

    featureform/featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

    Language:Go1.9k1415799
  • lilianweng/stock-rnn

    Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.

    Language:Python1.9k11528681
  • Kav-K/GPTDiscord

    A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!

    Language:Python1.9k29238293