text-embedding
There are 133 repositories under text-embedding topic.
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
ContextualAI/gritlm
Generative Representational Instruction Tuning
SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
staoxiao/RetroMAE
Codebase for RetroMAE and beyond.
hugochan/KATE
Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
milosgajdos/go-embeddings
Go module for fetching embeddings from embeddings providers
alash3al/vecdb
a vector embedding database with multiple storage engines and AI embedding integrations
mehdidc/DALLE_clip_score
Simple script to compute CLIP-based scores given a DALL-e trained model.
AstraBert/SenTrEv
Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs
lakeraai/canica
A text embedding viewer for the Jupyter environment
kidist-amde/amharic-ir-benchmarks
Official codebase for the ACL 2025 Findings paper: Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval.
cloudera/CML_AMP_Few-Shot_Text_Classification
Perform topic classification on news articles in several limited-labeled data regimes.
izhx/uni-rep
Code for embedding and retrieval research.
snapfinger/HSTU-BLaIR
HSTU-BLaIR: Lightweight Contrastive Text Embedding for Generative Recommender 🌱
mehdidc/clip_rerank
Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.
amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
NhanPhamThanh-IT/ML-Article-Classifier
🧠 ML-Article-Classifier is a modular Python project for classifying articles using advanced NLP techniques. It features sentence embeddings, clustering, and classification utilities, with Jupyter notebook demos, extensible helper functions, and best practices for research and production use.
deadbits/vector-embedding-api
Flask API for generating text embeddings using OpenAI or sentence_transformers
nhattruongpham/mmser
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
UBOS-tech/node-red-contrib-openai-ubos
A Node-RED node that interacts with OpenAI machine learning models to generate text like ChatGPT
brainsqueeze/text2vec
Contextual embedding for text blobs.
leungkimming/SK-DocumentSearch
Using Semantic Kernel to obtain answer from a PDF document, with embeddings stored in Redis and HuggingFace to create embeddings.
easonlai/product_recommendations_with_gpt
I have improved the demo by using Azure OpenAI’s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.
rafalposwiata/pl-mteb
PL-MTEB: Polish Massive Text Embedding Benchmark
sjy-dv/mind-x
Mind-X is my intelligent alter ego that understands me the best. It assists with and resolves my bothersome tasks, growing in real-time as a next-generation PersonAI system.
Aldenhovel/image-retrieval
An image retrieval engine . 图像检索系统。
jarvis0/image-search
🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.
Koldim2001/RAG_LLM
Чат-бот с LLL + RAG
bysiber/emimg-GUI
Image Steganography GUI | Easily Hide Text Files within Images with User-Friendly GUI | Pyton Tool
memfreeme/fast-embedding
Fast text embedding service
yanliang12/bert_text_embedding
Embedding a text to a vector by pre-trained BERT word embeddings and pooling layers, for the pur[ose of text similarity measuring
rosette-api/csharp
Babel Street Analytics Client Library for C#
nmdra/Semantic-Search
A semantic search system built with PostgreSQL and pgvector, powered by Gemini for generating text embeddings..