This pipeline is built using Airflow and Python.
- Data is extracted using Twitter API.
- Then it is transformed using python and its libraries.
- Create an EC2 instance (Ubuntu) and output S3 bucket. (Also, setup the airflow in EC2)
- Deploy the code onto EC2 and airflow.
- Airflow schedules and runs the code which saves the result onto the S3 bucket.