Overture Point of Interest (POI) data for the United Kingdom. Automatically queries the latest Overture AWS data using Dagster for ETL orchestration. The project is containerised using Docker (or Podman) Compose for easy deployment and management. Uses DuckDB with the spatial plugin to query only the UK bounding box.
-
Clone this repository:
git clone git@github.com:cjber/ingestion-checks.git
-
Navigate to the project directory:
cd overture-uk
- Run Project
To run the project, execute the following command:
NOTE: all docker
commands can be substituted with podman
docker compose up
Add
-d
to this command if you would prefer to run it in the background.
This starts the Docker containers for the Dagster Web server, Dagster Daemon, and the code container. Navigate to localhost:3000
to manage the Dagster orchestration pipeline. To initiate the automation; 'Automation→Schedules' toggle on to find new releases daily, and 'Automation→Sensors' to process new releases when found.
If you want to stop all containers, execute:
docker compose down
To also remove the containers, execute:
docker compose down --rmi all # (does not work with podman)