dask-distributed
There are 45 repositories under dask-distributed topic.
DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
TimeEval/TimeEval
Evaluation Tool for Anomaly Detection Algorithms on Time Series
JSybrandt/agatha
AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach
modin-project/unidist
Unified Distributed Execution
shauryashaurya/learn-data-munging
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
pyiron/pylammpsmpi
Parallel Lammps Python interface - control a mpi4py parallel LAMMPS instance from a serial python process or a Jupyter notebook
aws-solutions-library-samples/distributed-compute-on-aws-with-cross-regional-dask
Perform I/O intensive workloads on high-volume data sparsely located across multiple AWS regions through the use of Dask.
elcorto/psweep
Loop like a pro, make parameter studies fun.
gdmarmerola/big-data-ml-training
Code for "Training models when data doesn't fit in memory" post
jameslamb/lightgbm-dask-testing
Test LightGBM's Dask integration on different cluster types
IncubatorShokuhou/dask-tutorial-chinese
Dask tutorial;Dask汉化教程
ScalableCytometryImageProcessing/SCIP
Scalable Cytometry Image Processing (SCIP) is an open-source tool that implements an image processing pipeline on top of Dask, a distributed computing framework written in Python. SCIP performs projection, illumination correction, image segmentation and masking, and feature extraction.
gandalf1819/NYCOpenData-Profiling-Analysis
Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex
gjoseph92/sneks
Launch a Dask cluster from a Poetry environment
epiviz/epivizFileServer
Python library to query and transform genomic data from indexed files
octoenergy/dask-remote
Procurement: Dask Cluster as a Process.
pleiszenburg/scherbelberg
HPC cluster deployment and management for the Hetzner Cloud
eth-cscs/ipcluster_magic
Magic commands to support running MPI python code as well as multi-node Dask workloads on Jupyter notebooks.
leosmerling-hopeit/fraud-poc
Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/
LimnoTech/Xarray-DataAccessor
Efficiently read climate/meteorology data into Xarray using Dask for parallelization. Transform the data for your modelling needs.
VorGeo/earthengine-dask
Scale up concurrent requests to Earth Engine interactive endpoints with Dask
antarcticrainforest/esm_analysis
Python 3 tools for distributed analysis and visualisation of big climate data on HPC systems.
JulianWgs/dask-log-server
Preserve all necessary runtime data of a Dask client in order to "replay" and analyze the performance and behavior of the client after the fact
comp-dev-cms-ita/dask-remote-jobqueue
A custom dask remote jobqueue for HTCondor.
maawoo/stac-access-performance
Testing access performance of Sentinel-1 RTC metadata catalogs
sulis-hpc/sulis-hpc.github.io
User documentation website for the Sulis tier 2 HPC service. Built using Jekyll.
vlfom/nyc-taxi-data
Code for fetching, sampling, and analysis of NYC taxi data from TLC and Uber for 2009-2018
fabidick22/add-worker-DaskCluster
Script para configuración e installacion de requermientos de un worker de Dask Distributed
JBris/pycaret-fugue-dask-test
Testing PyCaret, Fugue, and Dask
KayDVC/semmed-neo4j
A project using the National Library of Medicine's Semantic Medline Database to create a graphical-relational database.
lebedov/dask-ml-on-azure-ml
Using Dask-ML on Azure ML
mabaszadeh/distributed-tsp
Distributed solution for Traveling Salesman Problem using Dask.distributed and OR-Tools
rolani/dask-ecs-lib
dask-ecs-lib is a Python library that effortlessly spins up a Dask cluster on AWS ECS using Fargate, allowing you to seamlessly execute and parallelize your functions.
Daniel-Elston/real-time-reddit-scalable-processing
Scaling NLP processing pipelines with Dask and PySpark, utilising Apache Kafka real-time data streaming, for optimal LLM training
OleksandrZadvornyi/dask-weather-analysis
Distributed processing and analysis of daily weather station summaries using Dask library
shiv3679/ClimEval
EvalMetrics: Precision in Prediction