text-similarity

There are 149 repositories under text-similarity topic.

  • Resume-Matcher

    srbhr/Resume-Matcher

    Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.

    Language:Python5.5k30672.3k
  • text2vec

    shibing624/text2vec

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Language:Python4.6k31150404
  • CLUEbenchmark/CLUEDatasetSearch

    搜索所有中文NLP数据集,附常用英文NLP数据集

    Language:Python4.2k6212614
  • NTMC-Community/awesome-neural-models-for-semantic-match

    A curated list of papers dedicated to neural text (semantic) matching.

    Language:HTML7755322122
  • murray-z/text_analysis_tools

    中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)

    Language:Python69385125
  • SeanLee97/AnglE

    Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

    Language:Python502105034
  • fanghon/antiplag

    作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.

    Language:Java37981361
  • nlpodyssey/cybertron

    Cybertron: the home planet of the Transformers in Go

    Language:Go292133726
  • cjymz886/sentence-similarity

    对四种句子/文本相似度计算方法进行实验与比较

    Language:Python2908259
  • amansrivastava17/lstm-siamese-text-similarity

    ⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity

    Language:Python28391088
  • dolos

    dodona-edu/dolos

    :detective: Source code plagiarism detection

    Language:TypeScript274727035
  • padeoe/cail2019

    法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军队伍

    Language:Python24681239
  • tlatkowski/multihead-siamese-nets

    Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.

    Language:Jupyter Notebook18391643
  • awslabs/aws-ai-solution-kit

    Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.

    Language:Python166211225
  • lonePatient/TorchBlocks

    A PyTorch-based toolkit for natural language processing

    Language:Python1538526
  • yaoxiaoyuan/mimix

    Mimix: A Text Generation Tool and Pretrained Chinese Models

    Language:Python15332017
  • nityansuman/marvin

    Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.

    Language:CSS1096133
  • IDEA-CCNL/GTS-Engine

    GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。

    Language:Python911169
  • ddangelov/RESTful-Top2Vec

    Expose a Top2Vec model with a REST API.

    Language:Python884520
  • adhaamehab/textblob-ar

    Arabic support for textblob

    Language:Python8581325
  • zake7749/CIKM-AnalytiCup-2018

    [ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.

    Language:Python766115
  • hellonlp/sentence-similarity

    文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

    Language:Python693111
  • Auto-Research

    sidphbot/Auto-Research

    Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

    Language:Python57127
  • Lipairui/textgo

    Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

    Language:Python43122
  • xiaorancs/text-similarity

    使用不同的方法计算相似度

    Language:Python42229
  • giacbrd/python-dandelion-eu

    A python client for connecting to all the services provided by https://dandelion.eu

    Language:Python3611915
  • simphile-text-similarity-nlp

    brianrisk/simphile-text-similarity-nlp

    Python Text Similarity NLP Libray

    Language:Python32423
  • siddgood/podcast-recommendation-engine

    :microphone: Building a content-based podcast recommender system using NLP

    Language:Jupyter Notebook30204
  • amansrivastava17/bns-short-text-similarity

    📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.

    Language:Python26303
  • ZhengZixiang/chip2019_task2_question_pairs_matching

    CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline,rank7

    Language:Python26217
  • KeremZaman/semantic-sh

    semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).

    Language:Python25353
  • Shivamrai15/Text-Similarity

    Two-part information retrieval system: 1) Pre-process text files, generate TF-IDF matrix and inverted index. 2) Retrieve relevant documents ranked by cosine similarity for given queries.

    Language:Python24100
  • aldebran97/AC

    AC自动机 文本相似检索 词库匹配 分词器

    Language:Java18100
  • sljavi/text-sound-similarity

    JavaScript library useful to find degrees of similarity between text's phonetics

    Language:JavaScript18201
  • themaximalist/vectordb.js

    Simple in-memory vector database for text similarity in Node.js

    Language:HTML18133
  • chiragjn/short-text-similarity

    Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475

    Language:Python16401