string-similarity
There are 102 repositories under string-similarity topic.
rapidfuzz/RapidFuzz
Rapid fuzzy string matching in Python using various string metrics
aceakash/string-similarity
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
adrg/strutil
Go metrics for calculating string similarity and other string utility functions
rapidfuzz/Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
rapidfuzz/rapidfuzz-cpp
Rapid fuzzy string matching in C++ using the Levenshtein Distance
rieck/harry
A Tool for Measuring String Similarity
usc-isi-i2/rltk
Record Linkage ToolKit (Find and link entities)
Daniel-Liu-c0deb0t/triple_accel
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
rapidfuzz/python-Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
stephenjjbrown/string-similarity-js
Lightweight string similarity function for javascript
agext/levenshtein
Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
searchhub/preDict
Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts
vickumar1981/stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Daniel-Liu-c0deb0t/UMICollapse
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
rapidfuzz/JaroWinkler
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
hyperjumptech/beda
Beda is a golang library for detecting how similar a two string
hbakhtiyor/strsim
string similarity based on Dice's coefficient in go
umbertogriffo/Trie
A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.
iesl/learned-string-alignments
Learning String Alignments for Entity Aliases
iesl/stance
Learned string similarity for entity names using optimal transport.
lewinfox/levitate
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
frizensami/plagiarism-basic
Offline quick-and-dirty text plagiarism checker written in Rust
ecomp-shONgit/string-distance
A set of (string) distance functions written in JavaScript / Python / PHP.
rapidfuzz/CyDifflib
CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.
ngmarchant/comparator
Similarity and distance measures for clustering and record linkage applications in R
mehrandvd/Simila
A project for string similarities.
NikkelM/Steam-App-ID-Finder
Automatically get the Steam App IDs for games you own on Steam, Epic Games or GOG. Or, simply provide a list of game names and the best matches will be found for you.
selmi-karim/dice-similarity-coeff
Find similarity between two strings, based on Dice Similarity Coefficient DSC
dgraham/scores
String similarity ranking for Vim's CtrlP fuzzy file finder.
bitfoundation/Simila
A project for string similarities.
jarvis0/image-search
🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.
ywu94/python-text-distance
A python implementation of a variety of text/string distance and similarity metrics. No GPL!
alyssonamaral/QLev
String distance metrics based on Levenshtein and Qwerty Matrix Distance
MrShoenel/git-density
A repository to hold the application git-density that is used to assess the source code density of git repositories.
felipelealdefaria/javascript-clone-detection
Academic study project on JavaScript code duplication using AST parsing and string similarity.