embeddings

There are 2278 repositories under embeddings topic.

  • supabase

    supabase/supabase

    The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

    Language:TypeScript79.9k5514.2k8.1k
  • mem0ai/mem0

    The Memory layer for AI Agents

    Language:Python26.9k1447842.6k
  • chroma-core/chroma

    the AI-native open-source embedding database

    Language:Rust19k1021.3k1.5k
  • Embedding/Chinese-Word-Vectors

    100+ Chinese Word Vectors 上百种预训练中文词向量

    Language:Python12k2831682.3k
  • h2oai/h2ogpt

    Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

    Language:Python11.7k1561.2k1.3k
  • txtai

    neuml/txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

    Language:Python10.6k105852676
  • FlagOpen/FlagEmbedding

    Retrieval and Retrieval-augmented LLMs

    Language:Python9.1k541.2k657
  • langchain4j/langchain4j

    Java version of LangChain

    Language:Java6.7k921.2k1.3k
  • postgresml/postgresml

    Postgres with GPUs for ML/AI apps.

    Language:Rust6.2k55256314
  • pytorch-metric-learning

    KevinMusgrave/pytorch-metric-learning

    The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

    Language:Python6.1k62521657
  • text2vec

    shibing624/text2vec

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Language:Python4.7k31154407
  • lancedb/lance

    Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

    Language:Rust4.3k471.2k269
  • AutoRAG

    Marker-Inc-Korea/AutoRAG

    AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

    Language:Python3.7k31604291
  • tensorflow/hub

    A library for transfer learning by reusing parts of TensorFlow models.

    Language:Python3.5k1537061.7k
  • obsidian-smart-connections

    brianpetro/obsidian-smart-connections

    Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

    Language:JavaScript3.5k37686201
  • huggingface/text-embeddings-inference

    A blazing fast inference solution for text embeddings models

    Language:Rust3.4k38315233
  • towhee-io/towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

    Language:Python3.3k29671258
  • lightly-ai/lightly

    A python library for self-supervised learning on images.

    Language:Python3.3k28588291
  • ml-surveys

    eugeneyan/ml-surveys

    📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.

  • hegelai/prompttools

    Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

    Language:Python2.8k3060237
  • SamurAIGPT/EmbedAI

    An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

    Language:JavaScript2.8k3569300
  • awesome-generative-ai

    filipecalegario/awesome-generative-ai

    A curated list of Generative AI tools, works, models, and references

  • datachain

    iterative/datachain

    ETL, Analytics, Versioning for Unstructured Data

    Language:Python2.5k17248107
  • PetrochukM/PyTorch-NLP

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    Language:Python2.2k5469256
  • axinc-ai/ailia-models

    The collection of pre-trained, state-of-the-art AI models for ailia SDK

    Language:Python2.2k52784341
  • vearch

    vearch/vearch

    Distributed vector search for AI-native applications

    Language:Go2.1k76582337
  • google/generative-ai-docs

    Documentation for Google's Gen AI site - including the Gemini API and Gemma

    Language:Jupyter Notebook1.9k73124676
  • xlang-ai/instructor-embedding

    [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

    Language:Python1.9k18112145
  • qdrant/fastembed

    Fast, Accurate, Lightweight Python library to make State of the Art Embedding

    Language:Python1.9k16194127
  • featureform

    featureform/featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

    Language:Jupyter Notebook1.9k1415797
  • lilianweng/stock-rnn

    Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.

    Language:Python1.9k11528677
  • Kav-K/GPTDiscord

    A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!

    Language:Python1.8k30238293
  • crmne/ruby_llm

    A delightful Ruby way to work with AI. No configuration madness, no complex callbacks, no handler hell – just beautiful, expressive Ruby code.

    Language:Ruby1.8k173369
  • yongzhuo/Keras-TextClassification

    中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

    Language:Python1.8k3388404
  • Hironsan/awesome-embedding-models

    A curated list of awesome embedding models tutorials, projects and communities.

    Language:Jupyter Notebook1.8k1064252
  • vector-admin

    Mintplex-Labs/vector-admin

    The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

    Language:TypeScript1.8k2694285