airscholar/e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Python
Stargazers
- adi611India
- AITYOUB-Abdelmoughit
- anastasiarblv
- Antonet99UNIVPM
- Ataa55Egypt
- ayushdayani28
- azarshab-saeed
- bloopepper
- edikemput1001
- giufalcaoThoughtWorks
- hameddavodiMilan, Italy
- HiAmChaseDa Nang
- himasha0421Laivly
- HnshlrBiot, Provence-Alpes-Côte d'Azur, France
- ibroh24NetPlusDotCom
- karanmrnLondon,United Kingdom
- kimj98
- Lizosysrinakharinwirot university
- Longwinter93
- marcos-data-engineerStudent
- michaelearncodingUniversity of Waterloo
- Mohamed-fawzyyNew Cairo - Fifth settlement
- morshed-sarwerDhaka, Bangladesh
- Naidine13015
- ng-hiepFPT Software
- NitinDatta8
- Rahul-shakyaMr.
- rahulraogrrBarclays plc
- RATTLESNAKE-VIPERMumbai,Maharashtra,India
- ravinani02
- sfuller14Northwestern University
- TDL77
- trijuhari
- veerak12
- YeonjiKim0316Seoul
- yzptLille (France)