Run DC/OS (including systemd) in Docker containers, one per node, using Docker-in-Docker for DC/OS services/jobs.
DC/OS Docker is designed to optimize developer cycle time. For a more production-like local experience, see DC/OS Vagrant which runs each node in its own VM.
- Because of Docker-in-Docker, DC/OS services (like Jenkins) that themselves use Docker-in-Docker may not work correctly.
- Because containerization does not affect resource detection tools, each DC/OS node will think it can allocate all of the host's resources, leading to over-subscription without protection. You are still bound by the disk and memory constraints of the host, even if DC/OS thinks you have N (number of agent nodes) times more. Running Docker in a VM can protect your host from this and allows you to designate how much disk/memory/cpu DC/OS gets in total. Running Docker directly on a Linux host gives DC/OS more resources to play with but may also freeze your machine if you run too many DC/OS services/jobs.
- DC/OS 1.8.2 introduced a bug that makes it incompatible with dcos-docker. Use DC/OS 1.8.1 or master until 1.8.3 is released.
- systemd
- make
- Docker 1.11
- A recent kernel that supports Overlay FS
- git
- VirtualBox 5.0.x
- Vagrant 1.8.4
- git
The following steps are REQUIRED on all hosts.
-
Clone this repo
git clone https://github.com/dcos/dcos-docker cd dcos-docker
-
Download DC/OS or Enterprise DC/OS
-
Move the installer to
dcos_generate_config.sh
in the root of this repo directory.
The following steps are REQUIRED on Macs and OPTIONAL on Linux.
-
Install VirtualBox
-
Install Vagrant
-
(Optional) Install vagrant-vbguest plugin (auto-updates vbox additions)
vagrant plugin install vagrant-vbguest
-
(Optional) Resize the vagrant disk
DC/OS should deploy with the default disk size of 10GB, but for larger deployments you may need to increase the size of the VM.
The first argument is the desired disk size in MB (ex: 102400 is 100GB).
vagrant/resize-disk.sh 102400
-
Bring up the virtual machine
vagrant up
-
SSH into the virtual machine
vagrant ssh
-
Deploy DC/OS in Docker
make
-
(Optional) Wait for DC/OS to come up
make postflight
For other make commands, see make help
.
To make the Docker containers in the VM reachable from the host, you can route Docker's IP subnet (172.17.0.0/16
) through the VM's IP (192.168.65.50
). This routing is not required if you deployed DC/OS to Docker on a native Linux host.
On Linux:
host$ sudo ip route replace 172.17.0.0/16 via 192.168.65.50
On Mac OS X:
host$ sudo route -nv add -net 172.17.0.0/16 192.168.65.50
Once routing is set up, you can access DC/OS directly from the host.
On Linux:
host$ sudo ip route del 172.17.0.0/16
On Mac OS X:
host$ sudo route delete 172.17.0.0/16
With network routing configured, you can SSH directly into DC/OS nodes from the host:
host$ ssh -i genconf/ssh_key root@172.17.0.2
Or you could use the DC/OS CLI:
dcos node ssh --leader --user=root --option IdentityFile=genconf/ssh_key
From the Linux host (or SSH'd into Vagrant) you can also use Docker exec to open a shell:
$ docker ps --format="table {{.ID}}\t{{.Names}}\t{{.Status}}"
CONTAINER ID NAMES STATUS
7498dcbe4e3e dcos-docker-pubagent1 Up About a minute
b66175f0a18a dcos-docker-agent1 Up About a minute
e80466ce71c9 dcos-docker-master1 Up About a minute
$ docker exec -it dcos-docker-master1 bash
There is no requirement on the hosts storage driver type, but the docker daemon
running inside docker container supports only aufs
and overlay
. The loopback
devicemapper may be problematic when it comes to loopback devices - they may not
be properly cleaned up and thus prevent docker daemon from starting. YMMV
though.
Unless user specifies graphdriver using DOCKER_GRAPHDRIVER
env variable,
the script tries to use the same one as the host uses. It detects it using
docker info
command. The resulting graphdriver must be among supported ones,
or the script will terminate.
This defaults to 1 master and 1 agent. You can change the number of masters by
setting the variable MASTERS
. You can change the number of agents by setting
the variable AGENTS
. For example:
$ make MASTERS=3 AGENTS=5
# start a cluster with 3 masters and 5 agents
NOTE: This feature should only be used for testing, it is unstable.
By default the cluster will be spun up using a centos base image but if you want to test something else you can run:
$ make DISTRO=fedora
Oh dear, you must be in an unfortunate position. You have a few options with regard to debugging your container cluster.
If the containers are currently running then the best option is to docker exec
into the master or agent and poke around. Here is an example of that:
$ docker exec -it dcos-docker-master1 bash
# list the systemd units
[root@dcos-docker-master1 /]# systemctl list-units
...
dbus.socket loaded active running D-Bus System Message Bus Socket
systemd-fail.service loaded failed exited Journal Audit Socket
systemd-journald-dev-log.socket loaded active running Journal Socket (/dev/log)
systemd-journald.socket loaded active running Journal Socket
basic.target loaded active active Basic System
dcos.target loaded active active dcos.target
local-fs.target loaded active active Local File Systems
...
# find the failed unit and get the status
[root@dcos-docker-master1 /]# systemctl status systemd-fail
# get the logs from journald
[root@dcos-docker-master1 /]# journalctl -xefu systemd-fail
For the dcos-spartan
service to start successfully, make sure that
you have dummy net driver support (CONFIG_DUMMY
) enabled in your kernel.
Most standard distribution kernels should have this by default. On some
older kernels you may need to manually install this module with
modprobe dummy
before starting the container cluster.
Various labels used on pull requests and what they mean
Work in progress
The code is a work in progress / not yet ready to be reviewed or acted upon by others. It can be handy to open up a PR in order to share work / ideas with others. Use this label to indicate the PR isn't intended to be reviewed or merged.Request for comment
The code is some idea which may or may not land, but there are questions if the approach is right. Review should focus on whether or not it is overall a good idea to do this and how to structure it.Ready for review
The author thinks the PR is ready to land, and is looking for a review in order to get it in. The PR may bounce back to "work in progress" or "request for comment" if it needs more work or discussion. Might also just do all the review and fixup with the label attached.