Pinned Repositories
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
beam-test
elitzur
flyteplugins
Flyte Backend Plugins contributed by the Flyte community.
flytepropeller
FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and is extensible using the flyteplugins/pluginmachinery interface
hadoop
Apache Hadoop
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
missinglink
Build time tool for detecting link problems in java projects
nevillelyh.github.io
Repository for www.lyh.me
parquet-benchmarks
clairemcginty's Repositories
clairemcginty/beam-test
clairemcginty/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
clairemcginty/elitzur
clairemcginty/flyteplugins
Flyte Backend Plugins contributed by the Flyte community.
clairemcginty/flytepropeller
FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and is extensible using the flyteplugins/pluginmachinery interface
clairemcginty/hadoop
Apache Hadoop
clairemcginty/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
clairemcginty/missinglink
Build time tool for detecting link problems in java projects
clairemcginty/nevillelyh.github.io
Repository for www.lyh.me
clairemcginty/parquet-benchmarks
clairemcginty/parquet-mr
Apache Parquet
clairemcginty/sbt-missinglink
An sbt plugin for missinglink
clairemcginty/scio
A Scala API for Apache Beam and Google Cloud Dataflow.
clairemcginty/socco-ng
socco-ng is a fork from criteo/socco: A Scala compiler plugin to generate documentation from Scala source files.
clairemcginty/styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.