Note: this is a proof-of-concept, and many things are incomplete or don't work.
Cubed is a distributed N-dimensional array library implemented in Python using bounded-memory serverless processing and Zarr for storage.
- Implements the Python Array API standard (see coverage status)
- Guaranteed maximum memory usage for standard array functions
- Follows Dask Array's chunked array API (
map_blocks
,rechunk
,apply_gufunc
, etc) - Zarr for persistent and intermediate storage
- Multiple serverless runtimes: Python (in-process), Lithops, Modal, Apache Beam
- Integration with Xarray via cubed-xarray