Pinned Repositories
aws-sdk-java
The official AWS SDK for Java.
banzai-charts
Curated list of Banzai Cloud Helm charts used by the Pipeline Platform
bigdata-platform-on-k8s
deploy bigdata platform on kubernetes
cube-studio
Cloud native one-stop machine learning platform, Multi-user, Dataleap, Notebook, Drag-and-Drop pipeline, Multi-machine multi-gpu distributed training, Automl, Inference, Edge computing, Federation schedule, Real time training, large models, AIhub
datahub
The Metadata Platform for the Modern Data Stack
dinky
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
docusaurus
Easy to maintain open source documentation websites.
flink
Apache Flink
flink-connector-elasticsearch
Apache Flink connector for ElasticSearch
flink-docker
Docker packaging for Apache Flink
david-z-johnson's Repositories
david-z-johnson/aws-sdk-java
The official AWS SDK for Java.
david-z-johnson/banzai-charts
Curated list of Banzai Cloud Helm charts used by the Pipeline Platform
david-z-johnson/bigdata-platform-on-k8s
deploy bigdata platform on kubernetes
david-z-johnson/cube-studio
Cloud native one-stop machine learning platform, Multi-user, Dataleap, Notebook, Drag-and-Drop pipeline, Multi-machine multi-gpu distributed training, Automl, Inference, Edge computing, Federation schedule, Real time training, large models, AIhub
david-z-johnson/datahub
The Metadata Platform for the Modern Data Stack
david-z-johnson/dinky
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
david-z-johnson/docusaurus
Easy to maintain open source documentation websites.
david-z-johnson/flink
Apache Flink
david-z-johnson/flink-connector-elasticsearch
Apache Flink connector for ElasticSearch
david-z-johnson/flink-docker
Docker packaging for Apache Flink
david-z-johnson/flink-kubernetes-operator
Apache Flink Kubernetes Operator
david-z-johnson/hadoop
Apache Hadoop
david-z-johnson/iperf-jperf
Improvements to jperf, a Java interface to the iperf network throughput testing suite
david-z-johnson/presto-on-k8s
Deploying Presto on K8S as a cloud OLAP Service, dynamic scaling based on HPA
david-z-johnson/spark
Apache Spark - A unified analytics engine for large-scale data processing
david-z-johnson/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
david-z-johnson/superset-2.1.0rc1
Apache Superset is a Data Visualization and Data Exploration Platform
david-z-johnson/talk-demos
Code & docs for Pipekit's talks
david-z-johnson/transporter
Sync data between persistence engines, like ETL only not stodgy