This was a very simple project to learn airflow and storing files in a AWS S3 bucket. In this case the extracted data are movies from the iTunes search API, then we extract some fields and create a CSV then upload it an Amazon S3 bucket. Everything was run on a Ubuntu EC2.
-iTunes API
-Python
-EC2
-S3 bucket
ETL.py which contains the main function that does the extraction, transforming and upload of the CSV.
itunes_dag.py contains the orchestration code that airflow uses to call the ETL function.