jaccard

There are 43 repositories under jaccard topic.

  • rockymadden/stringmetric

    :dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

    Language:Scala486412980
  • TiagoCortinhal/SalsaNext

    Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

    Language:Python4111968103
  • eulerto/pg_similarity

    set of functions and operators for executing similarity queries

    Language:C363163238
  • adrg/strutil

    Go metrics for calculating string similarity and other string utility functions

    Language:Go3074421
  • AllenInstitute/scrattch.hicat

    Hierarchical, iterative clustering for analysis of transcriptomics data in R

    Language:HTML106153131
  • massanishi/document_similarity_algorithms_experiments

    Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.

    Language:Python833229
  • vickumar1981/stringdistance

    A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

    Language:Scala7764415
  • dexyk/stringosim

    String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...

    Language:Go60508
  • dynatrace-research/set-sketch-paper

    SetSketch: Filling the Gap between MinHash and HyperLogLog

    Language:C++49245
  • gvegayon/twitterreport

    Out-of-the-box analysis and reporting tools for twitter

    Language:R39574
  • catalyst-team/segmentation

    Catalyst.Segmentation

    Language:Python286610
  • kseo/edit_distance

    Implementation of string distance algorithms in Dart

    Language:Dart263112
  • winkjs/wink-distance

    Distance/Similarity functions for Bag of Words, Strings, Vectors and more.

    Language:JavaScript23665
  • fitushar/Skin-lesion-Segmentation-using-grabcut

    Skin lesion segmentation is one of the first steps towards automatic Computer-Aided Diagnosis of skin cancer. Vast variety in the appearance of the skin lesion makes this task very challenging. The contribution of this paper is to apply a power foreground extraction technique called GrabCut for automatic skin lesion segmentation in HSV color space with minimal human interaction. Preprocessing was performed for removing the outer black border. Jaccard Index was measured to evaluate the performance of the segmentation method. On average, 0.71 Jaccard Index was achieved on 1000 images from ISIC challenge 2017 Training Dataset.

    Language:Python18111
  • oertl/treeminhash

    TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

    Language:C++14404
  • anshul1004/TweetsClustering

    Clustering similar tweets using K-means clustering algorithm and Jaccard distance metric

    Language:Python7104
  • kangqiwang/Information_Retrieval

    iamge Retrieva

    Language:Python7102
  • usc-isi-i2/ppjoin

    PPJoin and P4Join Python 3 implementation

    Language:Python690
  • ncchung/jaccard

    Statistical test of similarity between binary data using the Jaccard/Tanimoto coefficients

    Language:R5110
  • khaosdoctor/sound-recommender

    Simple API to recommend songs

    Language:TypeScript4201
  • tommenx/jaccard-similarity

    calculate jaccard similarity using mapreduce framework

    Language:Java4100
  • EgorBu/set_sim_search

    Highly optimized search for similar multisets

    Language:Python3100
  • Jimut123/bmsan

    Machine Learning 2 Course Project at RKMVERI, 2021. Published at The Imaging Science Journal (2023), Paper: https://www.tandfonline.com/doi/full/10.1080/13682199.2023.2174657

    Language:Jupyter Notebook3100
  • mhaseebtariq/doppel-speller

    An ML+NLP solution for linking misspelled titles with the true titles

    Language:Python3300
  • stefantaubert/quora-competition

    Code for Quora Competition on Kaggle

    Language:Python3102
  • LuoZijun/rust-jieba

    Rust jieba

    Language:Rust220
  • pharo-ai/metrics

    Implementation of Machine Learning metrics for Pharo

    Language:Smalltalk2512
  • DISCOSUMO/evaluation

    Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.

    Language:Python1400
  • fagnercarvalho/QuestionSimilarityTest

    Testing Jaccard similarity and Cosine similarity techniques to calculate the similarity between two questions.

    Language:C#140
  • medric49/w_distances

    Just some implementations of word distance functions.

    Language:Python1200
  • n-serrette/Cluster_Index

    Implementation of some intern and extern clustering indexes

    Language:Python1200
  • paul-sud/bigbed-jaccard

    A tool to approximate the Jaccard similarity of bigBed files from functional genomic datasets

    Language:Jupyter Notebook1100
  • sknepal/DocSim

    Calculating Jaccard & Cosine Similarity between texts.

    Language:Jupyter Notebook120
  • soenneker/soenneker.utils.string.jaccardsimilarity

    A utility library for comparing strings via the Jaccard similarity algorithm

    Language:C#120
  • Kaushal1011/CS441SimRankForGraphs

    This is the implementation of an algorithm that finds traceability links in two graphs such that the other graph is a perturbed version of the original graph.

    Language:Scala0200
  • guenthermi/fast_minh

    Python package for fast MinHash calculation and operations

    Language:C++10