/Building-a-real-time-data-pipeline

This project involves building a real-time data pipeline using Apache Kafka and Apache Spark Streaming. The pipeline ingests data, processes it in real-time, and outputs the processed data to datalake for storage and further analysis.

Primary LanguagePython

Watchers