This repository contains a simple Apache Airflow pipeline designed to process data, hyperparam tune a xgboost model, and test it using Airflow orchestration. The pipeline is Dockerized for easy deployment and reproducibility.
Before getting started, make sure you have Docker installed on your machine.
To run the machine learning pipeline, follow these steps:
-
Start the Airflow service:
docker compose up --build
-
Access the Airflow UI at http://localhost:8080/.
-
Log in using the following credentials:
- Username: airflow
- Password: airflow
-
Trigger your desired pipeline from the Airflow dashboard.
To stop the Airflow service and remove the containers, run:
docker compose down