/tabula-docker

Tabula data table PDF extraction for Docker http://tabula.technology/

Primary LanguageDockerfile

Tabula on Docker

Tabula is a tool for liberating data tables trapped inside PDF files. This is a Docker image to run the web server inside a lightweight Linux container.

Run from Docker Hub

docker run -d -p 8080:8080 asnelling/tabula

Now open your browser to http://localhost:8080/

Run options

You may wish to tweak the docker run command above in order to...

  • specify a specific version of Tabula

    # Example: v0.9.3
    docker run -d -p 8080:8080 asnelling/tabula:0.9.3
    
  • listen on a different port

    # Example: port 8100
    docker run -d -p 8100:8080 asnelling/tabula
    

    Access the web interface at http://localhost:8100/

  • follow the logs while the container runs

    docker run -p 8080:8080 asnelling/tabula
    

You may find more options in the official documentation

Build your own image

Instead of pulling a pre-built container image from asnelling/tabula, you may build your own using the Dockerfile in this repo:

git clone git@github.com:asnelling/tabula-docker.git
docker build -t mytabula tabula-docker
docker run -d -p 8080:8080 mytabula