similarity

There are 371 repositories under similarity topic.

  • computervision-recipes

    microsoft/computervision-recipes

    Best Practices, code samples, and documentation for Computer Vision.

    Language:Jupyter Notebook9.3k2862591.2k
  • text2vec

    shibing624/text2vec

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Language:Python4.2k29146379
  • msgi/nlp-journey

    Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.

    Language:Python1.6k625380
  • shibing624/similarity

    similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。

    Language:Java1.4k4037322
  • dssim

    kornelski/dssim

    Image similarity comparison simulating human perception (multiscale SSIM in Rust)

    Language:Rust1k227273
  • luozhouyang/python-string-similarity

    A library implementing different string similarity and distance measures using Python.

    Language:Python9732521126
  • recordlinkage

    J535D165/recordlinkage

    A powerful and modular toolkit for record linkage and duplicate detection in Python

    Language:Python91732135150
  • yaleimeng/Final_word_Similarity

    综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。

    Language:Python7101510153
  • shibing624/similarities

    Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

    Language:Python64783264
  • yongzhuo/Macropodus

    自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NER(name entity recognition),Find(new words discovery),Keyword(keyword extraction),Summarize(text summarization),Sim(text similarity),Calculate(scientific calculator),Chi2num(chinese number to arabic number)

    Language:Python647141494
  • aetilius/pHash

    pHash - the open source perceptual hash library

    Language:C++529172480
  • Brokenwind/BertSimilarity

    Computing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。

    Language:Python47471570
  • CogComp/cogcomp-nlp

    CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

    Language:Java46963385144
  • elisemercury/Duplicate-Image-Finder

    difPy - Python package for finding duplicate or similar images within folders

    Language:Python40766064
  • eulerto/pg_similarity

    set of functions and operators for executing similarity queries

    Language:C355153237
  • BiLiangLtd/WordSimilarity

    基于哈工大同义词词林扩展版的单词相似度计算方法

    Language:Python3537772
  • dingkeyan93/DISTS

    IQA: Deep Image Structure and Texture Similarity Metric

    Language:Python35041241
  • Dicklesworthstone/fast_vector_similarity

    The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

    Language:Rust3255111
  • WenRichard/Customer-Chatbot

    中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)

    Language:Python30788112
  • life4/textdistance.rs

    🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

    Language:Rust255309
  • bitlap/geocoding

    :globe_with_meridians: 地理编码技术,提供地址标准化和相似度计算。

    Language:Kotlin231104385
  • uhh-lt/sensegram

    Making sense embedding out of word embeddings using graph-based word sense induction

    Language:Python212313250
  • matiskay/html-similarity

    Compare html similarity using structural and style metrics

    Language:Python2065423
  • NLP-kr/tensorflow-ml-nlp

    텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스포머 챗봇까지)

    Language:Jupyter Notebook1992232106
  • Miraclemarvel55/ChatGLM-RLHF

    对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

    Language:Python1861726
  • brentlintner/synt

    Find similar functions and classes in your JavaScript/TypeScript code

    Language:TypeScript1825837
  • DengBoCong/text-similarity

    文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本

    Language:Python1682630
  • ml-classify-text-js

    andreekeberg/ml-classify-text-js

    Machine learning based text classification in JavaScript using n-grams and cosine similarity

    Language:JavaScript11971112
  • xiaobeicn/text-similarity-php

    通过余弦定理+分词计算文本相似度PHP版

    Language:PHP10810255
  • usc-isi-i2/rltk

    Record Linkage ToolKit (Find and link entities)

    Language:Python103121523
  • bnosac/ruimtehol

    R package to Embed All the Things! using StarSpace

    Language:C++100114013
  • kiwirafe/xiangshi

    中文文本相似度计算器

    Language:Python974620
  • google/unisim

    UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.

    Language:Python84593
  • andre-st/goodreads-toolbox

    9 tools for Goodreads.com, for finding people based on the books they’ve read, finding books popular among the people you follow, following new book reviews, etc

    Language:Perl807407
  • wooorm/levenshtein.c

    Levenshtein algorithm in C

    Language:C808014