airscholar/e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Python
Issues
- 1
- 2
ython dags/kafka_stream.py not working
#7 opened by oavioz - 4
Connect with Kafka not able to form
#4 opened by choonhongyeoh0241 - 1
- 1
Container start error
#3 opened by vraj-apto - 1
web server is not working,Which dependency.
#2 opened by Ram443