high-dimensional-data

There are 165 repositories under high-dimensional-data topic.

  • NVIDIA/MinkowskiEngine

    Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

    Language:Python2.5k46550369
  • ContextLab/hypertools

    A Python toolbox for gaining geometric insights into high-dimensional data

    Language:Python1.8k60197161
  • vald

    vdaas/vald

    Vald. A Highly Scalable Distributed Vector Search Engine

    Language:Go1.6k199977
  • abess

    abess-team/abess

    Fast Best-Subset Selection Library

    Language:C++48086541
  • ramhiser/datamicroarray

    A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.

    Language:R105102042
  • Tuyki/TT_RNN

    Language:Python10314637
  • gdkrmr/dimRed

    A Framework for Dimensionality Reduction in R

    Language:R7373315
  • daleroberts/hdmedians

    High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.

    Language:Python7112615
  • sergiocorreia/ppmlhdfe

    Poisson pseudo-likelihood regression with multiple levels of fixed effects

    Language:HTML65102313
  • great-northern-diver/loon

    A Toolkit for Interactive Statistical Data Visualization

    Language:Tcl4852116
  • GuansongPang/deep-outlier-detection

    Deep distance-based outlier detection published in KDD18: Learning representations specifically for distance-based outlier detection. Few-shot outlier detection

    Language:Python484213
  • lightonai/newma

    Implementation of NEWMA: a new method for scalable model-free online change-point detection

    Language:Python461016
  • VarIr/scikit-hubness

    A Python package for hubness analysis and high-dimensional data mining

    Language:Python444409
  • nanxstats/hdnom

    🔮 Benchmarking and visualization toolkit for penalized Cox models

    Language:R4381511
  • ejohnson643/EMBEDR

    Statistical quality evaluation of dimensionality reduction algorithms

    Language:Jupyter Notebook29592
  • mariaderrico/DPA

    The DPA package is the scikit-learn compatible implementation of the Density Peaks Advanced clustering algorithm. The algorithm provides robust and visual information about the clusters, their statistical reliability and their hierarchical organization.

    Language:Jupyter Notebook27369
  • epigen/unsupervised_analysis

    A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.

    Language:Python265604
  • NLeSC/DiVE

    An interactive 3D web viewer of up to million points on one screen that represent data. Provides interaction for viewing high-dimensional data that has been previously embedded in 3D or 2D. Based on graphosaurus.js and three.js. For a Linux release of a complete embedding+visualization pipeline please visit https://github.com/sonjageorgievska/Embed-Dive.

    Language:HTML26596
  • JoshEngels/FLINNG

    A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing

    Language:C++19104
  • mackelab/CorBinian

    CorBinian: A toolbox for modelling and simulating high-dimensional binary and count-data with correlations

    Language:MATLAB19534
  • MNoorFawi/lshashing

    python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data

    Language:Python19302
  • OFAI/hub-toolbox-python3

    Hubness analysis and removal functions

    Language:Python19474
  • angeloschatzimparmpas/t-viSNE

    t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

    Language:JavaScript17026
  • brian-lau/highdim

    Statistics for high-dimensional data (homogeneity, sphericity, independence, spherical uniformity)

    Language:Matlab17994
  • SuperXiang/High-Dimensional-Feature-Selection-of-Medical-Data

    Feature Selection by Optimized LASSO algorithm

    Language:MATLAB16204
  • KChen-lab/SCMarker

    Marker gene selection from scRNA-seq data

    Language:HTML15452
  • huangdonghere/SRCFS

    MATLAB code for Unsupervised Feature Selection with Multi-Subspace Randomization and Collaboration (SRCFS) (KBS 2019)

    Language:MATLAB14101
  • ivan-pi/fortran-flann

    Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.

    Language:Fortran14231
  • ramhiser/sparsediscrim

    Sparse and Regularized Discriminant Analysis in R

    Language:R144475
  • 0xshreyash/tsne-lib

    A simple library for t-SNE animation and a zoom-in feature to apply t-SNE in that region

    Language:Python13200
  • nanxstats/msaenet

    🧲 Multi-step adaptive estimation for reducing false positive selection in sparse regressions

    Language:R135177
  • acidjazz/json-browse

    jQuery plugin to easily browse and highlight your JSON

    Language:JavaScript11211
  • Califrais/flash

    A generalized joint model for high-dimensional multivariate longitudinal data and censored durations

    Language:Jupyter Notebook11503
  • shu-hai/D-CCA

    A Decomposition-based Canonical Correlation Analysis for High-dimensional Datasets (JASA-20 paper)

    Language:Python111010
  • astro-informatics/QuantifAI

    PyTorch-based radio-interferometric imaging reconstruction package with scalable Bayesian uncertainty quantification relying on data-driven (learned) priors

    Language:Jupyter Notebook10400
  • kravitsjacob/paxplot

    Paxplot is a Python visualization library for parallel coordinate plots based on matplotlib.

    Language:Python10191