Pinned Repositories
1trc
etl-tpch
examples
Examples using Dask and Coiled
micro-dask-tutorial-widsPS22
Dask workshop for Women in Data Science Puget Sound 2022
community
For general discussion and community planning. Discussion issues welcome.
dask
Parallel computing with task scheduling
labour-migration
Part of ABM course for social scientists with University of Southampton
linkedin_recruiter
Code to handle LinkedIn data from the recruiter platform
scharlottej13's Repositories
scharlottej13/1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
scharlottej13/coiled-from-prefect
How To Run A Coiled Workflow with Prefect
scharlottej13/coiled-runtime
scharlottej13/dask
Parallel computing with task scheduling
scharlottej13/dask-blog
Dask development blog
scharlottej13/dask-cloudprovider
Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...
scharlottej13/accelerated-computing-hub
NVIDIA curated collection of educational resources related to general purpose GPU programming.
scharlottej13/blog
Public repo for HF blog posts
scharlottej13/dask-expr
scharlottej13/dask-gateway
A multi-tenant server for securely deploying and managing Dask clusters.
scharlottej13/dask-jobqueue
Deploy Dask on job schedulers like PBS, SLURM, and SGE
scharlottej13/dask-kubernetes
Native Kubernetes integration for Dask
scharlottej13/dask-ml
Scalable Machine Learning with Dask
scharlottej13/dask-mpi
Deploy Dask using MPI4Py
scharlottej13/dask-sphinx-theme
Sphinx theme for Dask documentation
scharlottej13/dask-stories
scharlottej13/dask-tutorial
Dask tutorial
scharlottej13/dask-yarn
Deploy dask on YARN clusters
scharlottej13/etl-tpch
scharlottej13/geospatial-python
Introduction to Geospatial Raster and Vector Data with Python
scharlottej13/joblib
Computing with Python functions.
scharlottej13/mrocklin.github.io
Professional webpage
scharlottej13/noaa-nwm-dask
scharlottej13/open-data-registry
A registry of publicly available datasets on AWS
scharlottej13/prefect
The easiest way to coordinate your dataflow
scharlottej13/scikit-learn
scikit-learn: machine learning in Python
scharlottej13/sphinxcontrib.asciinema
Easily embed asciinema videos into Sphinx documentation
scharlottej13/talkpython-getting-started-with-dask
Material for Talk Python Training course on Getting Started with Dask.
scharlottej13/xarray
N-D labeled arrays and datasets in Python
scharlottej13/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow