clustering-analysis

There are 159 repositories under clustering-analysis topic.

  • milaan9/Clustering_Algorithms_from_Scratch

    Implementing Clustering Algorithms from scratch in MATLAB and Python

    Language:Jupyter Notebook20230179
  • rezacsedu/Deep-Learning-for-Clustering-in-Bioinformatics

    Deep Learning-based Clustering Approaches for Bioinformatics

    Language:Jupyter Notebook1414234
  • rohanmohapatra/hdbscan-cpp

    Fast and Efficient Implementation of HDBSCAN in C++ using STL

    Language:C++721918
  • monty-se/PINstimation

    A comprehensive bundle of utilities for the estimation of probability of informed trading models: original PIN in Easley and O'Hara (1992) and Easley et al. (1996); Multilayer PIN (MPIN) in Ersan (2016); Adjusted PIN (AdjPIN) in Duarte and Young (2009); and volume-synchronized PIN (VPIN) in Easley et al. (2011, 2012). Implementations of various estimation methods suggested in the literature are included. Additional compelling features comprise posterior probabilities, an implementation of an expectation-maximization (EM) algorithm, and PIN decomposition into layers, and into bad/good components. Versatile data simulation tools, and trade classification algorithms are among the supplementary utilities. The package provides fast, compact, and precise utilities to tackle the sophisticated, error-prone, and time-consuming estimation procedure of informed trading, and this solely using the raw trade-level data.

    Language:R40147
  • bessagroup/CRATE

    CRATE: Accurate and efficient clustering-based nonlinear analysis of heterogeneous materials through computational homogenization

    Language:Python39127
  • Simon-Bertrand/Clusters-Features

    The Clusters-Features package allows data science users to compute high-level linear algebra operations on any type of data set. It computes approximatively 40 internal evaluation scores such as Davies-Bouldin Index, C Index, Dunn and its Generalized Indexes and many more ! Other features are also available to evaluate the clustering quality.

    Language:Python33108
  • DOH-JDJ0303/bigbacter-nf

    Bacterial surveillance pipeline.

    Language:Nextflow26144
  • julherest/drought_clusters

    Code used to identify and analyze drought clusters from gridded data.

    Language:Python263012
  • DRLib/CDR

    Implementation of CDR - Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction

    Language:JavaScript22101
  • Clustering-by-Silhouette

    EtzionR/Clustering-by-Silhouette

    Optimize clustering labels using Silhouette Score.

    Language:Python15102
  • sharmaroshan/MNIST-Using-K-means

    It is One of the Easiest Problems in Data Science to Detect the MNIST Numbers, Using a Classification Algorithm, Here I have used a csv File which contains the Pixels of the Numbers from 0 to 9 and we have to Classify the Numbers Accordingly. I have Used K-Means Classification Algorithm.

    Language:HTML15007
  • marthadais/AISclassification

    A geometric-driven semi-supervised approach for fishing activity detection from AIS data.

    Language:Jupyter Notebook13101
  • salar96/MEP-Orthogonal-NMF

    Clustering and resource allocation using Deterministic Annealing Approach and Orthogonal Non-negative Matrix Factorization O-(NMF)

    Language:Jupyter Notebook11103
  • at-tan/Hierarchical_Clustering_of_Currencies

    A clustering exercise of global currencies on three common financial market features using data from 2017 through 2019, as published in Towards Data Science on Medium.com

    Language:Jupyter Notebook9203
  • dilettagoglia/DataMining

    🔎Data Understanding, Visualization , Preparation & Cleaning - Clustering algorithms (unsupervised learning) - Classification algorithms (supervised learning) - Sequential Pattern Mining

    Language:Jupyter Notebook9208
  • ShuyueG/CVI_using_DSI

    Cluster Validity Index Using a Distance-based Separability Measure

    Language:Python9115
  • pajaskowiak/clusterConfusion

    Clustering validation with ROC Curves

    Language:R7221
  • zcebeci/fcvalid

    Internal Validity Indexes for Fuzzy and Possibilistic Clustering

    Language:R7103
  • BayoAdejare/lightning-containers

    Docker powered starter for geospatial analysis of lightning atmospheric data.

    Language:Python6202
  • KaikeWesleyReis/kaggle

    Solutions for different datasets in Kaggle Website

    Language:Jupyter Notebook6200
  • danustc/Image_toolbox

    This is my toolbox for image processing and downstream analysis of calcium imaging data.

    Language:Jupyter Notebook5303
  • EtzionR/generate-Convex-Hull-SHP-from-HDBSCAN-clustering-probabilities

    Defines a boundary around cluster centers in a given point-layer shapefile.

    Language:Python5103
  • ArtemKovera/clust

    a few different clustering algorithms with python libraries for data science

    Language:Jupyter Notebook4105
  • caesarmario/Mall-Customers-Clustering-Analysis-using-SAS-Enterprise-Miner

    This repository contains mall customers clustering analysis. This repository also uses SAS Enterprise Miner to perform clustering and identify each cluster's characteristics. Full explanations about this repository can be seen on: https://medium.com/@caesarmario/mall-customers-clustering-analysis-da594bd2718b

  • MarinaMoreno/Client-Segmentation-Clustering

    This repository contains an ML project that was approached with a business mindset from the beginning to the end. It addresses the problem of clustering.

    Language:Jupyter Notebook4100
  • mustafahakkoz/Classification_Clustering_Freq_Pattern_Mining

    3 notebooks covering Classification, Clustering Analysis and Frequent Pattern Mining in the scope of Data Mining lectures in Marmara University.

    Language:Jupyter Notebook4100
  • AnFrBo/internet_censorship

    Analysis of the State of Internet Censorship in the United Kingdom Using Data Provided by OONI and Blocked Project as well as Scraped URL Meta Data

    Language:R3100
  • AYSE-DUMAN/Clustering-by-Business-Income-and-Expenses

    load and visualize data and clusters with scatter plots; prepare data for cluster analysis; perform centroid clustering with k-means; interpret clustering results and determine the optimal number of clusters for a given dataset.

    Language:Jupyter Notebook3200
  • Devanshi-Bavaria/Predictive-Modeling-for-Stock-Market-Trends

    📈 Comprehensive stock price analysis, including preprocessing, clustering, correlation, and predictive modeling, to enhance investment insights and accuracy. 💡

    Language:Jupyter Notebook3101
  • liruijia2017/Local-gap-density-for-clustering-high-dimensional-data-with-varying-densities

    A new clustering algorithm using local gap density

    Language:MATLAB3112
  • olivierzach/random-neighbors

    Random Neighbors: Random Forest style clustering for high-dimensional data

    Language:Python3101
  • paocarvajal1912/Crypto_Clustering

    Uses K-Means unsupervised machine learning algorithm and Principal Component Analysis to cluster cryptocurrencies based on performance in selected periods.

    Language:Jupyter Notebook3300
  • parthnan/IowaGamblingTask-Clustering

    Clustering Analysis of all available research data on the Iowa Gambling Task(list of sources in readme) using R. The Scripts produce the output for the most common archetypes among the dataset of one researcher using PCA.

    Language:R3300
  • barbarametzler/clusteringsatelliteimages

    code for PhD thesis

    Language:Python2100
  • Janice-Afi/Market-Segmentation

    This is a Clustering analysis on mall customers

    Language:Jupyter Notebook2100
  • KaranJoseph/DemandForecasting_SCA

    Demand Forecasting using time-series and tree based models for a CPG company that serves US and Canada. Inventory Management using Mixed Integer Linear Programming on the best forecast model.

    Language:Jupyter Notebook2300