tobigiwa/pyspark-structured-streaming-file-source
This is a Spark streaming program that streams data from a file source(e.g CSV, JSON, Parquet, AVRO), applies aggregations with SparkSQL and sends the data and its aggregations to a sink (e.g console, file source).
PythonMIT
No issues in this repository yet.