dask
There are 480 repositories under dask topic.
dask/dask
Parallel computing with task scheduling
rapidsai/cudf
cuDF - GPU DataFrame Library
pydata/xarray
N-D labeled arrays and datasets in Python
stumpy-dev/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
mars-project/mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
jmcarpenter2/swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
fugue-project/fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
dask/distributed
A distributed task scheduler for Dask
hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
narwhals-dev/narwhals
Lightweight and extensible compatibility layer between dataframe libraries!
itamarst/eliot
Eliot: the logging system that tells you *why* it happened
pytroll/satpy
Python package for earth-observing satellite data processing
Nixtla/mlforecast
Scalable machine 🤖 learning for time series forecasting.
capitalone/datacompy
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
ranaroussi/pystore
Fast data store for Pandas time-series data
polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
dask-contrib/dask-sql
Distributed SQL Engine in Python using Dask
Ouranosinc/xclim
Library of derived climate variables, ie climate indicators, based on xarray.
pytroll/pyresample
Geospatial image resampling in Python
DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
nebari-dev/nebari
🪴 Nebari - your open source data science platform
NVIDIA-Merlin/models
Merlin Models is a collection of deep learning recommender system model reference implementations
JiaweiZhuang/xESMF
Universal Regridder for Geospatial Data
aws-samples/amazon-sagemaker-local-mode
Amazon SageMaker Local Mode Examples
gjoseph92/stackstac
Turn a STAC catalog into a dask-based xarray
LDO-CERT/orochi
The Volatility Collaborative GUI
tkp-archive/paperboy
A web frontend for scheduling Jupyter notebook reports
pangeo-data/climpred
:earth_americas: Verification of weather and climate forecasts :earth_africa:
dask/dask-jobqueue
Deploy Dask on job schedulers like PBS, SLURM, and SGE
AllenCellModeling/aicsimageio
Image Reading, Metadata Conversion, and Image Writing for Microscopy Images in Python
ESDS-Leipzig/cubo
On-Demand Earth System Data Cubes (ESDCs) in Python
jgrss/geowombat
GeoWombat: Utilities for geospatial data
nci/scores
scores: Metrics for the verification, evaluation and optimisation of forecasts, predictions or models.
msoechting/lexcube
Lexcube: 3D Data Cube Visualization in Jupyter Notebooks
JDASoftwareGroup/kartothek
A consistent table management library in python
jcmgray/autoray
Abstract your array operations.