How I Use Docker for Data Science

Developing Models for Deployment

Prerequisites

Ensure that you have a working version of docker on your machine. Installation instructions for all major platforms can be found on the docker website.

You should be able to execute docker --version and see something like Docker version 1.10.3, build 20f81dd

If you are on linux, installation is very straight forward as docker takes advantage of the host's native kernel

If you are on Mac or Windows, you will need to use the docker-machine tools provided in the installation to run docker on a virtual machine. This is somewhat seamless, just follow the tutorials

create docker machine named dev:

docker-machine create -d virtualbox dev

start docker machine named dev:

docker-machine start dev

stop docker machine named dev:

docker-machine stop dev

Play with it

Take a look at the Hello World tutorial on docker.com

See what docker ps, docker logs, and docker stop do...

Development

You can view the presentation in Presentation.ipynb

We'll be using the repos:

Don't worry too much about what is in these repos just yet, but some familiarity will help.

Usage

If docker is properly installed, the workflow below will work

git clone https://github.com/mconley-kaizen/dockerapp.git
cd dockerapp
docker build -t mydockerapp .
cd ..
git clone https://github.com/mconley-kaizen/deploy.git
cd deploy
source launch.sh
deploy 5000 iris_prediction mydockerapp