airscholar/RedditDataEngineering
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
Python
Stargazers
- AfzalAliSolangi
- AleksalencarSão Paulo
- Antonet99UNIVPM
- bhanmrinalIndia
- d4datgirl
- Darpen24
- el-tegyESIGELEC
- Golder12
- IDCloverRi
- JaleelJenkins
- jaswanth333George Mason University
- katrinajane
- kotapullarao
- lcshlr
- lfy79001Institute of Automation, Chinese Academy of Sciences
- makenaichu970413
- mediumhust
- mohiteyashprogrammerMaharashtra,Mumbai
- MoraQsNigeria
- nguyendoanbb
- ninksazeracKing Mongkut's Institute of Technology Ladkrabang
- OGsiji@meritcapital
- omarsinno54
- rakeshacharya-dUT Dallas
- rattamnoonOrigin Property Public Company Limited
- Rcortes13Nashville, TN
- RkvishnuRemote
- Salamaleko
- SyedAfjal
- teju163dst
- therohanchoudharyGurgaon
- TriRizki
- tuanpa2295Hanoi
- tuanpham12215
- useDeep
- wentaozheng7