embedding

There are 529 repositories under embedding topic.

  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

    Language:TypeScript32.6k2874k5.6k
  • PaddleNLP

    PaddlePaddle/PaddleNLP

    👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

    Language:Python12.2k1053.6k3k
  • Embedding/Chinese-Word-Vectors

    100+ Chinese Word Vectors 上百种预训练中文词向量

    Language:Python11.9k2851682.3k
  • myreader-io/myGPTReader

    A community-driven way to read and chat with AI bots - powered by chatGPT.

    Language:Python4.4k5334452
  • adambielski/siamese-triplet

    Siamese and triplet networks with online pair/triplet mining in PyTorch

    Language:Python3.1k5069634
  • infiniflow/infinity

    The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

    Language:C++2.8k32461279
  • awesome-community-detection

    benedekrozemberczki/awesome-community-detection

    A curated list of community detection research papers with implementations.

    Language:Python2.3k1108362
  • run-llama/LlamaIndexTS

    Data framework for your LLM applications. Focus on server side solution

    Language:TypeScript2k17317375
  • devflowinc/trieve

    All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API

    Language:Rust1.7k141k147
  • pavlin-policar/openTSNE

    Extensible, parallel implementations of t-SNE

    Language:Python1.5k22140166
  • vercel/modelfusion

    The TypeScript library for building AI applications.

    Language:TypeScript1.2k136684
  • node-llama-cpp

    withcatai/node-llama-cpp

    Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

    Language:TypeScript1.1k149999
  • SkywalkerDarren/chatWeb

    ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

    Language:Python8912015136
  • myscale/MyScaleDB

    A @ClickHouse fork that supports high-performance vector search and full-text search.

    Language:C++889121646
  • zhezhaoa/ngram2vec

    Four word embedding models implemented in Python. Supporting arbitrary context features

    Language:Python8486323174
  • xing61/zzz-api

    优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增

    Language:PHP67962856
  • ContextualAI/gritlm

    Generative Representational Instruction Tuning

    Language:Jupyter Notebook57695341
  • shawroad/NLP_pytorch_project

    Embedding, NMT, Text_Classification, Text_Generation, NER etc.

    Language:Python558817120
  • cvxgrp/pymde

    Minimum-distortion embedding with PyTorch

    Language:Python540104727
  • OysterQAQ/ACG2vec

    ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)

  • cvqluu/Angular-Penalty-Softmax-Losses-Pytorch

    Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

    Language:Python486111891
  • marl/openl3

    OpenL3: Open-source deep audio and image embeddings

    Language:Jupyter Notebook477116658
  • aquila

    Aquila-Network/aquila

    An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.

    Language:HTML377214225
  • guangzhengli/vectorhub

    Quickly and easily build AI website or application by using embeddings!

    Language:TypeScript3724742
  • luyug/GradCache

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

    Language:Python36892924
  • PaddlePaddle/ERNIE-SDK

    ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

    Language:Jupyter Notebook352105352
  • llm-tools/embedJs

    A NodeJS RAG framework to easily work with LLMs and embeddings

    Language:TypeScript35069440
  • askaitools/askaitools-community-edition

    A cutting-edge search engine project tailored specifically for the AI product

    Language:TypeScript3411229
  • yongzhuo/Macadam

    Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。

    Language:Python3248338
  • marcominerva/ChatGptNet

    A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

    Language:C#309175837
  • snap-stanford/KGReasoning

    Multi-Hop Logical Reasoning in Knowledge Graphs

    Language:Python289111957
  • geeks-of-data/knowledge-gpt

    Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

    Language:Python28461153
  • GEMSEC

    benedekrozemberczki/GEMSEC

    The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

    Language:Python254151450
  • redis/redis-vl-python

    Redis Vector Library (RedisVL) interfaces with Redis' vector database for realtime semantic search, RAG, and recommendation systems.

    Language:Python241115741
  • microsoft/rag-experiment-accelerator

    The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

    Language:Python2122424574
  • shahsohil/DCC

    This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper

    Language:Python20993253