embedding-models

There are 156 repositories under embedding-models topic.

  • Separius/awesome-sentence-embedding

    A curated list of pretrained sentence and word embedding models

    Language:Python2.2k7819261
  • Hironsan/awesome-embedding-models

    A curated list of awesome embedding models tutorials, projects and communities.

    Language:Jupyter Notebook1.8k1084252
  • Sujit-O/pykg2vec

    Python library for knowledge graph embedding and representation learning.

    Language:Python6081398111
  • ContextualAI/gritlm

    Generative Representational Instruction Tuning

    Language:Jupyter Notebook56794840
  • marl/openl3

    OpenL3: Open-source deep audio and image embeddings

    Language:Jupyter Notebook468116558
  • StarlightSearch/EmbedAnything

    A minimalist yet highly performant, lightweight, lightning fast, multisource, multimodal and local embedding solution, built in rust.

    Language:Rust30461625
  • BBC-Esq/VectorDB-Plugin-for-LM-Studio

    Plugin that lets you ask questions about your documents including audio and video files.

    Language:Python291625736
  • mana-ysh/knowledge-graph-embeddings

    Implementations of Embedding-based methods for Knowledge Base Completion tasks

    Language:Python257251363
  • image_search_engine

    CVxTz/image_search_engine

    Image search engine

    Language:Python23315740
  • lgalke/vec4ir

    Word Embeddings for Information Retrieval

    Language:Python22613642
  • spcl/ncc

    Neural Code Comprehension: A Learnable Representation of Code Semantics

    Language:Python206133151
  • webvectors

    akutuzov/webvectors

    Web-ify your word2vec: framework to serve distributional semantic models online

    Language:Python197123748
  • yusufhilmi/client-vector-search

    A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.

    Language:TypeScript1705510
  • jgraving/selfsne

    Self-Supervised Noise Embeddings (Self-SNE)

    Language:Jupyter Notebook15832013
  • formath/tensorflow-predictor-cpp

    tensorflow prediction using c++ api

    Language:Python14981158
  • p768lwy3/torecsys

    ToR[e]cSys is a PyTorch Framework to implement recommendation system algorithms, including but not limited to click-through-rate (CTR) prediction, learning-to-ranking (LTR), and Matrix/Tensor Embedding. The project objective is to develop an ecosystem to experiment, share, reproduce, and deploy in real-world in a smooth and easy way.

    Language:Python1015317
  • shamspias/langchain-chat

    langchain-chat is an AI-driven Q&A system that leverages OpenAI's GPT-4 model and FAISS for efficient document indexing. It loads and splits documents from websites or PDFs, remembers conversations, and provides accurate, context-aware answers based on the indexed data. Easy to set up and extend.

    Language:Python868117
  • Denis2054/RAG-Driven-Generative-AI

    This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models for generation and evaluation.

    Language:Jupyter Notebook836028
  • kaushalshetty/Positional-Encoding

    Encoding position with the word embeddings.

    Language:Jupyter Notebook836213
  • ikergarcia1996/MetaVec

    A monolingual and cross-lingual meta-embedding generation and evaluation framework

    Language:Python80415
  • D2KLab/entity2vec

    Generates a set of property-specific entity embeddings from knowledge graphs using node2vec

    Language:Python7711324
  • KERMIT

    ART-Group-it/KERMIT

    🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings

    Language:JavaScript58719
  • datquocnguyen/STransE

    STransE: a novel embedding model of entities and relationships in knowledge bases (NAACL 2016)

    Language:C++557216
  • RoyZhengGao/edge2vec

    Learning node representation using edge semantics

    Language:Python525722
  • Glaciohound/VCML

    PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019

    Language:Python48388
  • nsrinidhibhat/gradio_RAG

    Code and resources showcasing the Retrieval-Augmented Generation (RAG) technique, a solution for enhancing data freshness in Large Language Models (LLMs). Incorporate up-to-date external knowledge into LLM-generated responses. Additionally, this repository includes a Gradio-based user interface for seamless model deployment.

    Language:Python422014
  • alisonbma/aiSFX

    Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

    Language:Python41424
  • su-park/mteb_ko_leaderboard

    한글 텍스트 임베딩 모델 리더보드

  • worldbank/GISTEmbed

    GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings

    Language:Python37421
  • databricks-industry-solutions/product-search

    Semantic product search on Databricks

    Language:Python291214
  • thustorage/PetPS

    PetPS: Supporting Huge Embedding Models with Tiered Memory

    Language:C++29302
  • Wang-Yu-Qing/UTPM

    Code for paper: Learning to Build User-tag Profile in Recommendation System

    Language:Python29295
  • BoYanSTKO/place2vec

    Place2Vec ground truth dataset

  • maxscheurer/cppe

    C++ and Python library for Polarizable Embedding

    Language:C++225175
  • UWNETLAB/dcss_supplementary

    Supplementary materials for McLevey 2021 Doing Computational Social Science (Sage, UK).

    Language:HTML195149
  • ritaranx/BMRetriever

    [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".

    Language:Python17332