jaccard-index

There are 31 repositories under jaccard-index topic.

  • adrg/strutil

    Golang metrics for calculating string similarity and other string utility functions

    Language:Go2994421
  • oertl/treeminhash

    TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

    Language:C++15404
  • Padhma/Liver-Disease-Prediction

    This project aims to predict liver disease in Indian patients

    Language:Jupyter Notebook7204
  • aallam/string-similarity-kotlin

    Kotlin multiplatform library offering various algorithms to measure string similarity and distance

    Language:Kotlin5200
  • rymshasaeed/lesion-segmentation

    Breast ultrasound (BUS) image segmentation using region-growing algorithm

    Language:MATLAB5100
  • micts/jss

    Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing

    Language:Python3100
  • caterinado/Machine-Learning-with-Python-The-Best-Classifier

    Built a classifier to predict whether a loan case will be paid off or not. Used classification algorithms (k-Nearest Neighbour, Decision Tree, Support Vector Machine, Logistic Regression). Each result is reported with the accuracy of each classifier (Jaccard index, F1-score, LogLoass)

    Language:Jupyter Notebook2100
  • duttaprasanta/clustering

    Different clustering and clustering metrics are implemented in this repository

    Language:Python2100
  • gene-hll

    esalini22/gene-hll

    HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

    Language:C++2100
  • Agisthemantobeat/Loan-Repay

    We load a historical dataset from previous loan applications, clean the data, and apply different classification algorithms on the data.

    Language:Jupyter Notebook110
  • Amirreza-Mousavi/Aspartate_Racemase_Ligands_Simlarity_Score

    An R script that uses MACCS166 chemical fingerprint and calculates Jaccard Index/Tanimoto Coefficient for a list of Aspartate Racemase Ligands

    Language:R1101
  • cissagatto/jaccard

    This code generate partitions for a multilabel dataset using the Jaccard Index similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.

    Language:R120
  • fagnercarvalho/QuestionSimilarityTest

    Testing Jaccard similarity and Cosine similarity techniques to calculate the similarity between two questions.

    Language:C#140
  • faisal-irzal/CC_Fraud_Detector

    Implementation of various machine learning techniques to detect credit card frauds based on a given dataset. This repo will guide you through the data analysis, viz and building predictive models

    Language:Jupyter Notebook1101
  • Hassaniftikhar24/LoanData-ML

    This project contains the KNN, SVM, Logistic Regression and Decision Tree algorithms applied to a loan data set. Model Evaluation is also presented at the end of this model.

    Language:Jupyter Notebook1100
  • Jonas1312/dice-coefficient-scale-sensitivity-pitfall

    The Dice Coefficient Is Scale Sensitive, Mathematical Proof.

  • m-imani/Classification-methods

    load a dataset using Pandas and apply the following classification methods (KNN, Decision Tree, SVM, and Logistic Regression) to find the best one by accuracy evaluation methods (Jaccard, F1-score, LogLoss) for this specific dataset.

    Language:Jupyter Notebook1100
  • MovieTone/JaccardDocumentComparison

    Document Comparison web application based on Jaccard Similarity Index. The uploaded file is compared to all previously uploaded ones. Built with Java/JSP

    Language:CSS1100
  • nikoshet/pyspark-movie-similarities

    Using Spark In Python For Movie Similarities With Jaccard Index

    Language:Python120
  • rekha-kandukuri/CentralisedInformationSystem

    A platform for both students and instructors to browse courses in the MOOC world easily. The platform features a recommender system that predicts courses of users preference from past courses, a Student-Instructor Course enrollment and Real-Time Discussion Forum Systems.

    Language:Jupyter Notebook1100
  • xp-song/photo-classify

    Classifying images into discrete categories based on keywords generated from the Google Cloud Vision API

  • youssefelmougy/jaccard-selector

    Asynchronous Distributed Actor-based Approach to Jaccard Similarity for Genome Comparisons

    Language:Fortran1200
  • devxdebanjan/AutoCorrect

    Implemented various spellcheck techniques like cosine similarity, jaccard similarity and levenshtein distance. Open to any further contributions.

    Language:Jupyter Notebook0100
  • Divya-Bhargavi/Kaggle_HomeDepot

    Predict search relevance given a product name and its text attributes

    Language:Jupyter Notebook0001
  • elizabethshen/Machine-Learning-Project

    Machine Learning with Python

    Language:Jupyter Notebook0100
  • FilobateerEssam/IBM-Machine-Learning-Project

    build a classifier to predict whether a loan case will be paid off or not. in loan applications, clean the data, and apply different classification algorithm on the data. use the following algorithms to build your models: k-Nearest Neighbour Decision Tree Support Vector Machine Logistic Regression The results is reported as the accuracy of each classifier, using the following metrics when these are applicable: Jaccard index F1-score LogLoass

    Language:Jupyter Notebook0100
  • hbenbel/Thematisation

    Pipeline that learns and recognize thematics

    Language:Python0300
  • Mohith-Kota/Clustering-Techniques-on-Synthetic-Real-World-Data

    The objective is to implement different clustering methods to synthetic and real-world data and validate using external and internal validation techniques

  • cissagatto/Generate-Partitions-Jaccard

    This code is part of my doctoral research. The aim is to generate partitions from the Jaccard index for multilabel classification.

    Language:R10
  • jas-haria/News-Recommendation-Reliability-Indicator-System

    A Google Chrome Extension that estimates the Reliability, Polarity and Subjectivity of any news article on the web. It allows you to like/dislike any article and recommends you articles based on your choices.

    Language:Jupyter Notebook00