/pyspark-stateful-processing-with-twitter-kafka

This is a simple project consisting of a pipeline of streaming processing with Apache Kafka, PySpark and Twitter Streaming API. This project is meant to understand the concepts behind stateful processing and event time processing with Spark Streaming

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers