Apache Airflow training
This project serves as a skeleton project on which Airflow training participants build exercises. There are two versions, one cloud based training and one on-premise based training.
On-Premise training
The on premise based training will run Apache Airflow based on a Docker stack. This includes docker-compose to hook up multiple containers to simulate a multi container setup. A guide on how to setup k8s locally: https://xebia.com/blog/running-kubernetes-locally-docker-mac-os-x/
git clone git@github.com:godatadriven/airflow-training-skeleton.git
cd airflow-training-skeleton/
kubectl apply -f k8s/
Hit up a browser and browse to http://localhost:30808/admin/ to see the admin screen.
Cloud based training
At the start of the training, you will receive a Google Cloud Platform project, with a pre-configured Cloud Composer environment (Google's managed Airflow).
Training participants generally like taking their code home after the training, so we provide them with a skeleton Airflow project on GitHub. In order to push and deploy, we kindly ask the participants to setup their own CI because we don't know the GitHub handles of participants beforehand. However the setup is just a few clicks, described below.
A CI pipeline is included with the project (cloudbuild.yml
), which defines steps to execute after each push. However, there are variables which must be entered before applying the CI pipeline. The CI pipeline tests and deploys your code to Google Cloud Composer.
- Fork the repository.
- In the GCP console, go to your GCP project, and browse to Cloud Build.
- Go to Build triggers.
- Click "Create trigger".
- Select source GitHub and click Continue.
- You have to allow Google Cloud Platform to access your repositories.
- In Cloud Build, you can now select your repositories. Select this forked repository and check the consent box.
- Under
Build configuration
, selectcloudbuild.yaml
and click Create trigger at the bottom.
Done. The just configured build trigger now watches changes on your repository.