Learning ansible

This is a intermediate level course to learn ansible from scratch.

The course is made to work on Debian based systems (Ubuntu, Mint...) but should be easily transferable to other Linux distributions or Unices.

create a virtual machine

We set up a virtual machine that we're configuring via ansible.

first configure a local virtual machine to work with
create yourself a clean new user and change into it
```
$ sudo adduser learnansible
$ su - learnansible
```

copy your ssh key over to the VM

create a ssh key pair
```
learnansible:~$ ssh-keygen
```
for the sake of simplicity do not protect your key with a password.

copy your new ssh public key over to the vm

learnansible:~$ ssh-copy-id -p 5555 root@localhost

test that connecting to the VM works without a password prompt
```
learnansible:~$ ssh root@localhost -p 5555
```

configure ssh for easy ssh access to the vm

learnansible:~$ vim ~/.ssh/config
Host vm
HostName localhost
Port 5555
User root

test that you can now log into the vm quickly
```
learnansible:~$ ssh vm
```

preparations...

make yourself a working directory for ansible

learnansible:~$ mkdir ansible
learnansible:~$ cd ansible

from here on, we'll allways be working as the learnansible user inside the ~/ansible directory
```
learnansible:~/ansible$ pwd
/home/learnansible/ansible
```
install ansible
```
$ sudo apt-get install ansible
```

define the ansible inventory

create the inventory of the machines ansible will take care of
```
$ mkdir inventory
$ vim inventory/production
vm
```

tell ansible about where its resources are

$ vim ansible.cfg
[defaults]
inventory = /home/learnansible/ansible/inventory/production

ansible in ad-hoc mode

your hello world ansible

$ ansible vm -m ping
vm | SUCCESS => {
    "changed": false,
    "ping": "pong"
}

What happened here? We have called ansible in the ad-hoc mode. We have told ansible where to execute a command - namely on the vm machine. And we have told what to do on that machine -m ping, which means "execute the module ping".

What the ping module does, is that it gets executed on the remote machine, and emits a "pong"?

The result is green, so everything is OK, also ansible explicitly tells us "SUCCESS".

We also see that the ping module didn't "changed: false" anything on the remote machine (it only emitted a "pong").

Instead of telling ansible to execute the module on the vm machine, we could tell ansible to execute it on "all machines it knows about":

$ ansible all -m ping

Try that out.

How does ansible know how to connect to the vm machine? It uses ssh by default to connect to a machine, and we have configured ssh previously to know about Host vm.

There are other modules...

the command module

The command module is the default ansible module. So we can write:

$ ansible vm -m command -a "echo bla"

(Execute that!)

Or, because command is the default module, we can omit the -m command parameter:

$ ansible vm -a "echo bla"

What's -a "echo bla"? It's the parameter we give to the module. The command module expects to be given the command to execute, which is echo followed by whatever we would like to pass on to the echo command.

other modules

There are many more modules.

executing on many machines

Until now, it seems like ansible is just a pathetic wrapper around ssh. But we're just starting. We can define groups of machines (and groups of groups) for ansible to execute stuff on. So we can...

$ vim inventory/production
[webservers]
vm
othermachine

... define a group and tell ansible to execute on that group:

$ ansible webservers -a 'echo bla'
vm | CHANGED | rc=0 >>
bla

othermachine | CHANGED | rc=0 >>
bla

Unfortunately we do not have a othermachine but we can fake it, by defining it exactly as our first vm machine:

$ vim ~/.ssh/config
[...]
Host othermachine
HostName localhost
Port 5555
User root

That connects to our original VM, but ansible doesn't know or care, so...

(Try ansible webservers -a 'echo bla' now)

Ansible has syntax to express on which machines and groups you want to have stuff executed (or not).

inventories

The inventory above is just one form of inventory that ansible can use to find out about its managed machines. Ansible has inventory plugins that let you access read diverse inventory formats, also dynamic ones, which allow you to access inventories in live databases or other configuration management systems.

configuring machines

If we want to configure more that one thing on a (set of) machine(s) then we want to use playbooks.

A simple playbook that checks whether another machine is reachable via the same ping module as before looks like this:

$ vim available.yml
- hosts: vm
  tasks:
    - name: is the machine there?
      ping:

You'll notice the correspondence with the command:

$ ansible vm -m ping

Also, you'll maybe notice, that the format of the playbook is YAML.

You can run that playbook:

$ ansible-playbook available.yml 

PLAY [vm] ********************************************

TASK [Gathering Facts] *******************************
ok: [vm]

TASK [is the machine there?] *************************
ok: [vm]

PLAY RECAP *******************************************
vm         : ok=2    changed=0    unreachable=0    failed=0

Ansible executes the play, executes a "Gathering Facts" task and then our "is the machine there?" task.

In the end it shows us a summary of the results of the actions taken.

information gathering

What's that "Gathering Facts" task we've just seen? It's ansible finding out things about the target system.

We can run this fact gathering step standalone...

$ ansible vm -m setup
vm | SUCCESS => {
    "ansible_facts": {
        "ansible_all_ipv4_addresses": [
            "10.0.2.15"
        ],
        "ansible_all_ipv6_addresses": [
            "fec0::5054:ff:fe12:3456",
[...]

... and it outputs heaps of infos about the system, which could be reused to f.ex. feed some other database.

Within the context of an ansible play it can be used to make decisions:

conditionals, variables

Ansible's gather_facts aka setup collects information that we can reuse in plays:

$ vim is_buster.yml
- hosts: vm
  tasks:
    - name: tell user that we're on buster
      debug: msg="whoa, we're on Debian buster"
      when: ansible_distribution_release == 'buster'

$ ansible-playbook is_buster.yml
PLAY [vm] ********************************************

TASK [Gathering Facts] *******************************
ok: [vm]

TASK [tell user that we're on buster] ****************
ok: [vm] => {
    "msg": "whoa we're on Debian buster"
}

PLAY RECAP *******************************************
vm         : ok=2    changed=0    unreachable=0    failed=0

We are using the debug module here, that accepts a msg parameter, which it outputs.

Also the task is executed conditionally, depending on whether the when: condition is true or false.

In this case we're checking whether ansible's ansible_distribution_release variable is equal to the value buster.

The ansible_distribution_release variable gets set by ansible's gather_facts aka setup task.

Jinja2

What follows the when: ansible keyword is a "jinja2 expression". Jinja2 is a Python templating language. That means that all that jinja2 can do, we can use for conditionals.

But we can use jinja2 expressions more or less anywhere in a playbook (or in a task or in a role, more later):

- hosts: vm
  tasks:
    - name: tell user that we're on buster
      debug: msg="whoa, we're on Debian {{ ansible_distribution_release }}"

This will do what you think it will (try it out!). If ansible finds a double quoted string with a "{{ }}" element then everything inside the curly brackets will be interpreted as a jinja expression. So we might write...

[...]
debug: msg="Are we on buster - {{ ansible_distribution_release == 'buster' }}"

...and we'll get:

[...]
ok: [vm] => {
    "msg": "Are we on buster - True"
}

task formatting

Previously we've written:

- name: tell user that we're on buster
  debug: msg="whoa, we're on Debian {{ ansible_distribution_release }}"

Ansible understands that and it's ok for short one liners. A bit longer would, but semantically equivalent would be clean YAML:

- name: tell user that we're on buster
  debug:
    msg: "whoa, we're on Debian {{ ansible_distribution_release }}"

What do we have there syntactically? A - introduces an element in an array in yaml. So we're adding a task to the tasks list.

If a line is not preceded by a minus, then that's a key value pair. So name is the key of a dictionary entry and "tell user that we're on buster" is the value of that dictionary entry.

In fact we're defining the "name" of a task inside a tasks list.

Next debug is the name of a module. That module takes a parameter. So we have a dictionary entry with the name "debug" that consist of a dictionary or as value. That dictionary contains the parameters for that debug module, first of which msg which represents the message that debug should output.

In fact msg itself can be an array, in which case we'd write:

- name: tell user that we're on buster
  debug:
    msg:
      - "Whoa!"
      - "We're on Debian {{ ansible_distribution_release }}"

roles

Ansible has a concept of "roles". The metaphorical idea would be that you have machines, that you assign roles in a playbook. You have machine vm that acts like a mail server f.ex.

So you would create a role mail_server:

$ mkdir -p roles/mail_server

Our first task in that role would be to install the server. There's a specialized module for package installation: apt:

$ mkdir roles/mail_server/tasks
$ vim roles/mail_server/tasks/main.yml
- name: install exim mail server
  apt:
    pkg: exim4

Ansible is searching for tasks in a role by default under roles/role_name/tasks/main.yml.

Now let's assign that role to our vm:

$ vim mail_hub.yml
- hosts: vm
  roles:
    - mail_server

and execute it:

$ ansible-playbook mail_hub.yml 
PLAY [vm] ********************************************

TASK [Gathering Facts] *******************************
ok: [vm]

TASK [mail_server : install exim mail server] ********
changed: [vm]

PLAY RECAP *******************************************
vm         : ok=2    changed=1    unreachable=0    failed=0

idempotence

let's execute it again:

$ ansible-playbook mail_hub.yml 
PLAY [vm] ********************************************

TASK [Gathering Facts] *******************************
ok: [vm]

TASK [mail_server : install exim mail server] ********
ok: [vm]

PLAY RECAP *******************************************
vm         : ok=2    changed=0    unreachable=0    failed=0

Now ansible is telling us, that the result of our "install exim mail server" was "ok", because it didn't need to re-install the package.

If we run the playbook again and again, the result will be the same. If we deinstall the package then ansible will re-install it again.

So we want two things:

if nothing needs to be changed, then ansible should detect that and take no action
no matter what, the result of a play should be always the same. Play, roles, tasks should be idempotent.

copy module

Let's copy a TLS certificate over to the mail server, so that it can do SMTPS.

We use the copy module for that:

$ vim roles/mail_server/tasks/main.yml
[...]

- name: copy certificate over to mail server
  copy:
    src: certificate.key
    dest: /etc/exim4/certificate.key

If we run the playbook we note that ansible can't find certificate.key. Let's provide it to ansible:

$ mkdir roles/mail_server/files

That is the default place where roles look for files. Let's put the certificate.key there:

$ cp certificate_from_somewhere.key roles/mail_server/files/certificate.key

(You can use whatever you want for certificate_from_somewhere.key, it won't be used, this is just an example).

file encryption

If we keep our ansible infrastructure in git, then roles/mail_server/files/certificate.key would be in there in clear text.

Let's encrypt it then:

$ ansible-vault encrypt roles/mail_server/files/certificate.key

Ansible will ask for passwords. After this that file is encrypted. Now however you will need to tell the ansible to ask you for the password:

$ ansible-playbook mail_hub.yml --ask-vault-pass

Otherwise ansible will complain. Ansible will automatically and transparently decrypt the file as it encounters it.

iterating over items

The TLS certificate we want to copy over to the server consists of two parts the public certificate and the private key. We have only copied over the key.

If we want to copy over the public certificate as well then we could just add:

$ vim roles/mail_server/tasks/main.yml
[...]

- name: copy public certificate over to mail server
  copy:
    src: certificate.pub
    dest: /etc/exim4/certificate.pub

We duplicate the existing instructions. But we want to follow the DRY principle.

We can achieve that by iterating over the files we want to have copied over to the server. So instead we write:

$ vim roles/mail_server/tasks/main.yml
[...]

- name: copy certificate parts over to mail server
  copy:
    src: "{{ item }}"
    dest: "/etc/exim4/{{ item }}"
  with_items:
    - certificate.key
    - certificate.pub

Now run that playbook.

templates and lineinfile

Next we'd like configure the smarhost, where our new mailserver relays emails to.

For that we need to set dc_smarthost='mailrelay.example.org' in /etc/exim4/update-exim4.conf.conf.

There are various approaches to solving this problem. One is to replace just the on line in that file. We could use the lineinfile module for that. There are also a blockinfile and a replace that provide similar functionality.

Let's use a different approach and use templates instead:

$ vim roles/mail_server/tasks/main.yml
[...]

- name: set smart host
  template:
    src: update-exim4.conf.conf.j2
    dest: /etc/exim4/update-exim4.conf.conf

If we run the playbook, we'll notice that ansible can't find update-exim4.conf.conf.j2. Let's create it. First create the templates directory where a role by default expects templates to be.

$ mkdir roles/mail_server/templates/

Now copy the original config file from our mailserver to the templates directory:

$ scp vm:/etc/exim4/update-exim4.conf.conf roles/mail_server/templates/update-exim4.conf.conf.j2

You'll notice the .j2 extension, which implies that this is a Jinja2 template. Now we modify that original file:

$ vim roles/mail_server/templates/update-exim4.conf.conf.j2

We replace the line:

dc_smarthost=''

with

dc_smarthost='{{ mail_relay }}'

You notice the Jinja2 syntax.

We could have put:

dc_smarthost='mailrelay.example.org'

in there, but this time we want to make the task generic, so in the future we can use that role to configure other mail hubs. So we now need to set the variable mail_relay:

$ vim mail_hub.yml
- hosts: vm
  roles:
    - { role: mail_server, mail_relay: mailrelay.example.org }

role syntax in playbooks

alternatively we can write:

$ vim mail_hub.yml
- hosts: vm
  roles:
    - role: mail_server
      mail_relay: mailrelay.example.org

$ vim mail_hub.yml
- hosts: vm
  roles:
    - role: mail_server
      vars:
        mail_relay: mailrelay.example.org

it amounts to the same. You'll maybe want to use the first form for short one-liners and the second one otherwise.

more on Jinja2

Jinja2 is a powerful templating language. It supports conditionals, looping, filters and more. You'll want to browse its documentation if you need to construct more complex templates.

default values for variables

If we set up a few mail servers, we might notice that in most of the cases our mail relay is mailrelay.example.org, we only rarely are requred to use a different one.

So let's add a default for the mail_relay variable:

$ mkdir roles/mail_server/defaults/
$ vim roles/mail_server/defaults/main.yml
mail_relay: mailrelay.example.org

Now we do not need to pass the mail_relay variable to the mail_server role if it's mailrelay.example.org:

$ vim mail_hub.yml
- hosts: vm
  roles:
    - mail_server

handlers

Only configuring the mail server is not enough for the changes to take effect. Often we need to restart or reload the daemon.

However restarting a daemon on every change we apply to configuration is inefficient. What we want is to relaunch the daemon after all changes have been applied.

That's what handlers are for. We notify them of the need to perform an action and in the end, after all else is done, ansible will execute that action.

In our case here, we also need to run update-exim4.conf that will produce /var/lib/exim4/config.autogenerated from the /etc/exim4/update-exim4.conf.conf settings.

So first we notify the handler:

- name: set smart host
  template:
    src: update-exim4.conf.conf.j2
    dest: /etc/exim4/update-exim4.conf.conf
  notify: update exim4 config

And we add a handler that knows what to do:

$ mkdir roles/mail_server/handlers
$ vim roles/mail_server/handlers/main.yml
- name: update exim4 config
  command: update-exim4.conf

more important stuff

Only keywords here, since I'm running out of time:

writing modules
--check --diff
tags
action modifiers: become_user, change_mode, changed_when, ...
set_fact
include/include_role/import_task
blocks
per host, per group variables

clean up

Remove the stuff we've created.

deluser --remove-home learnansible

consulting and courses

If you want to get ansible consulting or hire me for an ansible course then please contact me at

Tomáš Pospíšek, tpo_hp at sourcepole.ch

credits

Thanks:

Mathias Walker

sourcepole/learning_ansible