text-embeddings

There are 69 repositories under text-embeddings topic.

  • michaelfeil/infinity

    Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

    Language:Python2.4k20231164
  • linkedin/detext

    DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

    Language:Python1.3k3617136
  • nomic-ai/contrastors

    Train Models Contrastively in Pytorch

    Language:Python747144158
  • ZachNagengast/similarity-search-kit

    🔎 SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.

    Language:Swift488161845
  • yusufhilmi/client-vector-search

    A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.

    Language:TypeScript2175615
  • rag-wtf/open-text-embeddings

    Open Source Text Embedding Models with OpenAI Compatible API

    Language:Python16022
  • milosgajdos/go-embeddings

    Go module for fetching embeddings from embeddings providers

    Language:Go53200
  • md-experiments/picture_text

    Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)

    Language:Python30309
  • Sid2697/Word-recognition-EmbedNet-CAB

    Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"

    Language:Python21305
  • lakeraai/canica

    A text embedding viewer for the Jupyter environment

    Language:TypeScript20301
  • aws-samples/text-embeddings-pipeline-for-rag

    A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store

    Language:TypeScript19101
  • amazon-science/text_generation_diffusion_llm_topic

    Topic Embedding, Text Generation and Modeling using diffusion

    Language:Python15314
  • deadbits/vector-embedding-api

    Flask API for generating text embeddings using OpenAI or sentence_transformers

    Language:Python14251
  • vinaykanigicherla/amazon_reviews_sentiment

    Sentiment Analysis on the Amazon Reviews Dataset using BERT-based transfer learning approach.

    Language:Jupyter Notebook10101
  • BjornMelin/stardex

    🌟 Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's starred repositories. Discover hidden patterns and find your next favorite project with intelligent, AI-powered exploration.

    Language:TypeScript9180
  • easonlai/product_recommendations_with_gpt

    I have improved the demo by using Azure OpenAI’s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.

    Language:Jupyter Notebook9106
  • Navy10021/KRLawGPT

    KRLawGPT : Generative Pre-trained Transformer for producing Korean Legal Text

    Language:Python9101
  • lh0x00/docsifer

    Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.

    Language:Python8100
  • mind-x

    sjy-dv/mind-x

    Mind-X is my intelligent alter ego that understands me the best. It assists with and resolves my bothersome tasks, growing in real-time as a next-generation PersonAI system.

    Language:Go8100
  • tlack/semantics

    Semantic similarity via text embeddings in Elixir - powered by SentenceTransformers by SBert.net

    Language:Elixir8201
  • lh0x00/lightweight-embeddings

    LightweightEmbeddings is a fast, free, and unlimited API service for multilingual embeddings and reranking, with support for both text and images and guaranteed uptime.

    Language:Python6102
  • zer0int/CLIP-gradient-ascent-embeddings

    Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training

    Language:Python610
  • amirmasoudaz/chatgpt-history-search

    A Python-based search engine for OpenAI's ChatGPT conversation history, enabling efficient semantic search and interactive engagement with archived chats using text embeddings

    Language:Python5101
  • themaximalist/embedding.js

    Easy embeddings for LLMs like gpt-3.5-turbo and gpt-4 using text-embedding-ada-002

    Language:JavaScript5200
  • cjboy76/askpdf

    Read PDF with AI.

    Language:Vue4100
  • dice-group/GATES

    Graph Attention Networks for Entity Summarization is the model that applies deep learning on graphs and ensemble learning on entity summarization tasks.

    Language:Python4610
  • ksm26/Understanding-and-Applying-Text-Embeddings

    Dive into the world of text embeddings. This course will guide you through leveraging text embeddings to enhance various natural language processing (NLP) tasks.

    Language:Jupyter Notebook4106
  • theatina/Stress_Detection

    M.Sc. mini project for NLP class (M908)

    Language:Python4101
  • damoncrockett/embeddingworld

    Fully client side web app for visualizing text embeddings

    Language:JavaScript30
  • salgadev/dolly-expert-lite

    A lightweight Dolly-v2 powered assistant that can answer domain-specific questions and keep a conversation. It's expert systems in the era of LLMs.

    Language:Jupyter Notebook3102
  • gurbaj5124871/rag-app-deepseek

    A RAG (Retrieval-Augmented Generation) application which combines retrieval-based and generative approaches to improve the accuracy and relevance of AI-generated responses.

    Language:Python2100
  • joshle298/Debrief

    A news insight summarizer designed to cut the noise out of media consumption (using LLMs & text-embeddings)

    Language:JavaScript2201
  • ksmin23/semantic-vector-search-with-sagemaker-pgvector

    A search application using Aurora Postgresql and pgvector for an online retail store product catalog

    Language:Jupyter Notebook210
  • mahadev0811/CollegeChatbot

    This project is a Q&A chatbot designed for the Global Academy of Technology (GAT), utilizing LLMs, Embeddings, RAG techniques and Prompt Engineering to provide accurate and context-aware responses to user queries about the college.

    Language:Jupyter Notebook2100
  • seonglae/tei

    Text Embeddings Inference (TEI)'s unofficial python wrapper library for batch processing with asyncio

    Language:Python2101
  • zeno129/DYANE

    DYnamic Attributed Node rolEs (DYANE) is an attributed dynamic-network generative model based on temporal motifs and attributed node behavior.

    Language:Python2100