cfmcgrady
Apache Kyuubi PMC Member / Apache Celeborn PMC Member / Apache Spark Contributor / Delta Contributor
@apacheHangzhou, China
Pinned Repositories
akka-zk-cluster-seed
almond
A scala kernel for Jupyter
ammonite-spark
Run spark calculations from Ammonite
analytics-zoo
Analytics + AI Platform for Apache Spark and BigDL
calcite
Mirror of Apache Calcite
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
kungfu-panda
Kungfu Panda is a library for register python pandas UDFs in Spark SQL.
spark-adaptive
spark-rest-source
A Rest Api Structured Streaming DataSource
SparkStreamingKafkaDemo
cfmcgrady's Repositories
cfmcgrady/spark-rest-source
A Rest Api Structured Streaming DataSource
cfmcgrady/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
cfmcgrady/kungfu-panda
Kungfu Panda is a library for register python pandas UDFs in Spark SQL.
cfmcgrady/spark-adaptive
cfmcgrady/akka-zk-cluster-seed
cfmcgrady/almond
A scala kernel for Jupyter
cfmcgrady/ammonite-spark
Run spark calculations from Ammonite
cfmcgrady/calcite
Mirror of Apache Calcite
cfmcgrady/canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件 。阿里云DRDS( https://www.aliyun.com/product/drds )、阿里巴巴TDDL 二级索引、小表复制powerd by canal. Aliyun Data Lake Analytics https://www.aliyun.com/product/datalakeanalytics powered by canal
cfmcgrady/davinci
Davinci is a DVaaS (Data Visualization as a Service) Platform
cfmcgrady/documents-zh
cfmcgrady/gluten
Gluten: Plugin to Double SparkSQL's Performance
cfmcgrady/incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
cfmcgrady/incubator-celeborn-website
Apache Celeborn Site
cfmcgrady/incubator-hudi
Upserts And Incremental Processing on Big Data
cfmcgrady/incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
cfmcgrady/incubator-kyuubi-website
Apache Kyuubi Site
cfmcgrady/koalas
Koalas: Pandas API on Apache Spark
cfmcgrady/kyuubi-docker
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
cfmcgrady/mlflow
Open source platform for the machine learning lifecycle
cfmcgrady/mlflow-in-action
cfmcgrady/raydp
RayDP: Distributed data processing library that provides simple APIs for running Spark on Ray and integrating Spark with distributed deep learning and machine learning frameworks.
cfmcgrady/spark
Mirror of Apache Spark
cfmcgrady/spark-extensions
cfmcgrady/spark-sql-perf
cfmcgrady/SparkCube
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
cfmcgrady/sqlflow
Brings SQL and AI together.
cfmcgrady/streamingpro
Build Spark Streaming Application by SQL
cfmcgrady/unitycatalog
Open, Multi-modal Catalog for Data & AI
cfmcgrady/velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.