high-dimensional-data

There are 168 repositories under high-dimensional-data topic.

  • NVIDIA/MinkowskiEngine

    Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

    Language:Python2.8k44568425
  • ContextLab/hypertools

    A Python toolbox for gaining geometric insights into high-dimensional data

    Language:Python1.9k59197163
  • vald

    vdaas/vald

    Vald. A Highly Scalable Distributed Vector Search Engine

    Language:Go1.6k1810489
  • abess

    abess-team/abess

    Fast Best-Subset Selection Library

    Language:C++48776542
  • ramhiser/datamicroarray

    A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.

    Language:R10592043
  • Tuyki/TT_RNN

    Language:Python10413637
  • daleroberts/hdmedians

    High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.

    Language:Python7711614
  • gdkrmr/dimRed

    A Framework for Dimensionality Reduction in R

    Language:R7363516
  • sergiocorreia/ppmlhdfe

    Poisson pseudo-likelihood regression with multiple levels of fixed effects

    Language:HTML72102411
  • great-northern-diver/loon

    A Toolkit for Interactive Statistical Data Visualization

    Language:Tcl4942116
  • GuansongPang/deep-outlier-detection

    Deep distance-based outlier detection published in KDD18: Learning representations specifically for distance-based outlier detection. Few-shot outlier detection

    Language:Python484213
  • lightonai/newma

    Implementation of NEWMA: a new method for scalable model-free online change-point detection

    Language:Python45916
  • nanxstats/hdnom

    🔮 Benchmarking and visualization toolkit for penalized Cox models

    Language:R4471511
  • VarIr/scikit-hubness

    A Python package for hubness analysis and high-dimensional data mining

    Language:Python443409
  • epigen/unsupervised_analysis

    A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.

    Language:Python344614
  • ejohnson643/EMBEDR

    Statistical quality evaluation of dimensionality reduction algorithms

    Language:Jupyter Notebook29592
  • mariaderrico/DPA

    The DPA package is the scikit-learn compatible implementation of the Density Peaks Advanced clustering algorithm. The algorithm provides robust and visual information about the clusters, their statistical reliability and their hierarchical organization.

    Language:Jupyter Notebook28369
  • NLeSC/DiVE

    An interactive 3D web viewer of up to million points on one screen that represent data. Provides interaction for viewing high-dimensional data that has been previously embedded in 3D or 2D. Based on graphosaurus.js and three.js. For a Linux release of a complete embedding+visualization pipeline please visit https://github.com/sonjageorgievska/Embed-Dive.

    Language:HTML26496
  • JoshEngels/FLINNG

    A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing

    Language:C++23104
  • mackelab/CorBinian

    CorBinian: A toolbox for modelling and simulating high-dimensional binary and count-data with correlations

    Language:MATLAB19434
  • MNoorFawi/lshashing

    python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data

    Language:Python19302
  • OFAI/hub-toolbox-python3

    Hubness analysis and removal functions

    Language:Python19374
  • angeloschatzimparmpas/t-viSNE

    t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

    Language:JavaScript18026
  • brian-lau/highdim

    Statistics for high-dimensional data (homogeneity, sphericity, independence, spherical uniformity)

    Language:MATLAB17893
  • SuperXiang/High-Dimensional-Feature-Selection-of-Medical-Data

    Feature Selection by Optimized LASSO algorithm

    Language:MATLAB16204
  • KChen-lab/SCMarker

    Marker gene selection from scRNA-seq data

    Language:HTML15452
  • huangdonghere/SRCFS

    MATLAB code for Unsupervised Feature Selection with Multi-Subspace Randomization and Collaboration (SRCFS) (KBS 2019)

    Language:MATLAB14101
  • ivan-pi/fortran-flann

    Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.

    Language:Fortran14231
  • pedbrgs/PyCCEA

    A Python package of cooperative co-evolutionary algorithms for feature selection in high-dimensional data.

    Language:Python14202
  • ramhiser/sparsediscrim

    Sparse and Regularized Discriminant Analysis in R

    Language:R143475
  • 0xshreyash/tsne-lib

    A simple library for t-SNE animation and a zoom-in feature to apply t-SNE in that region

    Language:Python13200
  • nanxstats/msaenet

    🧲 Multi-step adaptive estimation for reducing false positive selection in sparse regressions

    Language:R134177
  • acidjazz/json-browse

    jQuery plugin to easily browse and highlight your JSON

    Language:JavaScript12111
  • the-fang/Hybrid-K-means-Pso

    An advanced version of K-Means using Particle swarm optimization for clustering of high dimensional data sets, which converges faster to the optimal solution.

    Language:MATLAB12020
  • wangxb96/MEL

    Code for “MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional Feature Selection“--[IEEE Transactions on Knowledge and Data Engineering (TKDE 24)]

    Language:MATLAB12203
  • Califrais/flash

    A generalized joint model for high-dimensional multivariate longitudinal data and censored durations

    Language:Jupyter Notebook11504