davewdb's Stars
databrickslabs/tempo
API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
databricks/koalas
Koalas: pandas API on Apache Spark
mlflow/mlflow
Open source platform for the machine learning lifecycle
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs