/gh-archive-data-pipeline

A comprehensive ELT pipeline for GitHub Archive using Pyspark, Flink, Kafka, Airflow and monitoring with Prometheus/Grafana.

Primary LanguagePython

This repository is not active