scalable-data-analysis

There are 14 repositories under scalable-data-analysis topic.

  • parashardhapola/scarf

    Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.

    Language:Python10256412
  • COM6012/ScalableML

    COM6012 Scalable Machine Learning - University of Sheffield. Enjoy our resources? ⭐ Star this repository to show your support and help others discover it!

    Language:HTML876084
  • Caleydo/lineupjs

    Fork and custom implementation of LineUp Library for Visual Analysis of Multi-Attribute

    Language:TypeScript70124184
  • emmalanguage/emma

    A quotation-based Scala DSL for scalable data analysis.

    Language:Scala631514019
  • YogiOnBioinformatics/Computational-Drug-Discovery-Internship-at-Merck

    Description of work done at Merck pharmaceutical company in the summer of 2018 as a Computational Drug Discovery Intern at West Point, PA. Information excludes all proprietary information belonging to Merck & Co.

    Language:Python4002
  • kaydotdev/stochastic-quantization

    Robust and Scalable Clustering with Stochastic Quasi-Gradient K-means

    Language:Jupyter Notebook2100
  • manuparra/knowledgegraphs

    Knowledge data processing

    Language:HTML1361
  • efeag/aga-MSDA

    This repository contain projects completed during my graduate study in Data Science & Analytics at the J. Mack Robinson College of Business, Georgia State University. I worked as part of a team of 4 or 6 members and we equally contributed in completing tasks and preparing final documentations (code file, report & PowerPoint presentation).

    Language:Jupyter Notebook0100
  • JayLohokare/sparkGIS

    Spark GIS (Docker + Flask Webserver + SparkGIS)

    Language:Java0200
  • lapets/course-data-mechanics

    Lecture notes and other materials for a one-semester course on data mechanics.

    Language:HTML0101
  • terilios/automated_data_scientist

    Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.

    Language:Python0101
  • Caleydo/taggle

    deprecated use lineup.js develop branch instead

    Language:TypeScript5113
  • mmaguero/cloud-based-tool-SA

    A cloud-based tool for sentiment analysis in reviews about restaurants on TripAdvisor

    Language:Python20