string-distance
There are 71 repositories under string-distance topic.
tdebatty/java-string-similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
J535D165/recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
xdrop/fuzzywuzzy
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
hbollon/go-edlib
đź“š String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
feature23/StringSimilarity.NET
A .NET port of java-string-similarity
adrg/strutil
Go metrics for calculating string similarity and other string utility functions
Turnerj/Quickenshtein
Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
cadmiumcr/cadmium
Natural Language Processing (NLP) library for Crystal
matthieugomez/StringDistances.jl
String Distances in Julia
fasiha/mudderjs
Lexicographically-subdivide the “space” between strings, by defining an alternate non-base-ten number system using a pre-defined dictionary of symbol↔︎number mappings. Handy for ordering NoSQL keys.
Daniel-Liu-c0deb0t/triple_accel
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
agext/levenshtein
Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
anirbanmu/str_metrics
Ruby gem (native extension in Rust) providing implementations of various string metrics
wyndow/fuzzywuzzy
Fuzzy string matching for PHP
technikhil314/offline-diff-viewer
A Privacy focused, easy sharable, open source and anonymous tracking diff viewer.
dexyk/stringosim
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
dedupeio/affinegap
:triangular_ruler: A Cython implementation of the affine gap string distance
hyperjumptech/beda
Beda is a golang library for detecting how similar a two string
iesl/stance
Learned string similarity for entity names using optimal transport.
dedupeio/pyhacrf
:triangular_ruler: Hidden alignment conditional random field for classifying string pairs.
lorenzocestaro/seqalign
Collection of sequence alignment algorithms.
lovit/levenshtein_finder
Similar string search in Levenshtein distance
Dynom/TySug
A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts
ecomp-shONgit/string-distance
A set of (string) distance functions written in JavaScript / Python / PHP.
cicirello/JavaPermutationTools
A Java library for computation on permutations and sequences
obulkin/string-dist
A Python library for calculating string distances using C extensions (with a pure Python fallback)
clownpriest/strings
strings for zig
sumn2u/string-comparisons
A collection of string comparisons algorithms
mehrandvd/Simila
A project for string similarities.
nkkarpov/editdistancek
LMS algorithm for computing edit distance with SIMD optimizations
bitfoundation/Simila
A project for string similarities.
ywu94/python-text-distance
A python implementation of a variety of text/string distance and similarity metrics. No GPL!
jhermsmeier/node-sift-distance
SIFT distance algorithm
OlivierBinette/groupbyrule
Deduplicate data using fuzzy and deterministic matching rules.
sp1ff/damerau-levenshtein
Comparison of a few algorithms for computing Damerau–Levenshtein distance
TeodorDyakov/wildcard-trie
String trie that supports wildcard search