embedding

There are 458 repositories under embedding topic.

  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

    Language:Python28.5k2643.2k5k
  • Embedding/Chinese-Word-Vectors

    100+ Chinese Word Vectors 上百种预训练中文词向量

    Language:Python11.6k2861662.3k
  • PaddleNLP

    PaddlePaddle/PaddleNLP

    👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

    Language:Python11.6k1033.4k2.8k
  • madawei2699/myGPTReader

    A community-driven way to read and chat with AI bots - powered by chatGPT.

    Language:Python4.4k5134449
  • adambielski/siamese-triplet

    Siamese and triplet networks with online pair/triplet mining in PyTorch

    Language:Python3.1k5069631
  • awesome-community-detection

    benedekrozemberczki/awesome-community-detection

    A curated list of community detection research papers with implementations.

    Language:Python2.3k1098361
  • infiniflow/infinity

    The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

    Language:C++1.9k22260151
  • msgi/nlp-journey

    Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.

    Language:Python1.6k625380
  • run-llama/LlamaIndexTS

    LlamaIndex is a data framework for your LLM applications

    Language:TypeScript1.4k17184281
  • pavlin-policar/openTSNE

    Extensible, parallel implementations of t-SNE

    Language:Python1.4k22132157
  • devflowinc/trieve

    All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.

    Language:Rust1.1k753896
  • vercel/modelfusion

    The TypeScript library for building AI applications.

    Language:TypeScript977156373
  • SkywalkerDarren/chatWeb

    ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

    Language:Python8632015136
  • zhezhaoa/ngram2vec

    Four word embedding models implemented in Python. Supporting arbitrary context features

    Language:Python8366323174
  • myscale/MyScaleDB

    An open-source, high-performance SQL vector database built on ClickHouse.

    Language:C++699121031
  • shawroad/NLP_pytorch_project

    Embedding, NMT, Text_Classification, Text_Generation, NER etc.

    Language:Python551817116
  • cvxgrp/pymde

    Minimum-distortion embedding with PyTorch

    Language:Python52194527
  • OysterQAQ/ACG2vec

    ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)

  • xing61/xiaoyi-robot

    优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增

    Language:PHP48051936
  • cvqluu/Angular-Penalty-Softmax-Losses-Pytorch

    Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

    Language:Python476111891
  • marl/openl3

    OpenL3: Open-source deep audio and image embeddings

    Language:Jupyter Notebook424116557
  • ContextualAI/gritlm

    Generative Representational Instruction Tuning

    Language:Jupyter Notebook42283228
  • aquila

    Aquila-Network/aquila

    An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.

    Language:HTML374214225
  • guangzhengli/vectorhub

    Quickly and easily build AI website or application by using embeddings!

    Language:TypeScript3384740
  • yongzhuo/Macadam

    Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。

    Language:Python3238338
  • luyug/GradCache

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

    Language:Python31692519
  • PaddlePaddle/ERNIE-SDK

    ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

    Language:Jupyter Notebook315104944
  • marcominerva/ChatGptNet

    A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

    Language:C#285155535
  • geeks-of-data/knowledge-gpt

    Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

    Language:Python27251151
  • snap-stanford/KGReasoning

    Multi-Hop Logical Reasoning in Knowledge Graphs

    Language:Python262101754
  • GEMSEC

    benedekrozemberczki/GEMSEC

    The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

    Language:Python251151449
  • shahsohil/DCC

    This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper

    Language:Python20793253
  • amansrivastava17/embedding-as-service

    One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

    Language:Python201102629
  • DANMF

    benedekrozemberczki/DANMF

    A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).

    Language:Python20110340
  • sajari/word2vec

    Go library for performing computations in word2vec binary models

    Language:Go1909735
  • GraphWaveMachine

    benedekrozemberczki/GraphWaveMachine

    A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".

    Language:Python1847735