/pyspark-structured-streaming-file-source

This is a Spark streaming program that streams data from a file source(e.g CSV, JSON, Parquet, AVRO), applies aggregations with SparkSQL and sends the data and its aggregations to a sink (e.g console, file source).

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.