DBpedia Databus Collection Downloader

This is a light-weight dockerized data-downloader for the DBpedia Databus. This container takes a collection URI and pulls its data to a local directory. It does not support any format or compression conversion. If your application requires conversion, please check the Databus Client.

Run the container with the following two environment variables:

  • TARGET_DIR: The target directory for the downloaded files
  • COLLECTION_URI: A collection URI on the DBpedia Databus

In order to retrieve the downloaded files on your local machine you should mount a volume to the specified TARGET_DIR.

Docker Image

You can find the docker image on Docker Hub here or build it yourself by running

docker build -t databus-download-min .

in the projects root directory.

Lock File

In order to make this container interoperable with others, the process creates a download.lck file in the TARGET_DIR (defaults to /root/data/) directory on startup and removes it once the download has finished. Other containers with access to the mounted folder can check if this file exists to wait for the download process to terminate.

Example

The docker-compose.yml in this repo shows an example configuration.

version: "3.0"
services:
  databus-download:
    image: dbpedia/dbpedia-databus-collection-downloader:latest
    environment:
      COLLECTION_URI: https://databus.dbpedia.org/dbpedia/collections/latest-core
      TARGET_DIR: /root/data
    volumes:
      - ./download:/root/data

This configuration will download the DBpedia Latest Core release into a ./download folder next to your docker-compose.yml In order to access the files on your local machine you should mount a volume to the TARGET_DIR folder.

Start the download by running

docker-compose up