embeddings
There are 2881 repositories under embeddings topic.
supabase/supabase
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
chroma-core/chroma
Open-source search and retrieval database for AI applications.
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
neuml/txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
langchain4j/langchain4j
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.
apache/seatunnel
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
postgresml/postgresml
Postgres with GPUs for ML/AI apps.
KevinMusgrave/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Tencent/WeKnora
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
brianpetro/obsidian-smart-connections
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
lightly-ai/lightly
A python library for self-supervised learning on images.
tensorflow/hub
A library for transfer learning by reusing parts of TensorFlow models.
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
filipecalegario/awesome-generative-ai
A curated list of Generative AI tools, works, models, and references
hegelai/prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
crmne/ruby_llm
One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, GPUStack & OpenAI compatible APIs. Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.
eugeneyan/ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
SamurAIGPT/EmbedAI
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
iterative/datachain
ETL, Analytics, Versioning for Unstructured Data
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
milvus-io/bootcamp
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
PetrochukM/PyTorch-NLP
Basic Utilities for PyTorch Natural Language Processing (NLP)
vearch/vearch
Distributed vector search for AI-native applications
google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
Mintplex-Labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
featureform/featureform
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
lilianweng/stock-rnn
Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.
Kav-K/GPTDiscord
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!