/cloud-seed

Fetch user-data from a cloud provider, set the hostname, and write files. A very minimal alternative to cloud-init.

Primary LanguageRustApache License 2.0Apache-2.0

cloud-seed

crates.io

cloud-seed is a very minimal alternative to cloud-init. It can set the hostname and write files based on directives provided in user data.

Motivation

Launching a server should be deterministic: launching the same image with the same configuration should have the same result. To that end, server images should include all software and base configuration required for the function of the server.

Installing software and performing configuration via scripts on startup introduces opportunities for failure or non-determinism. To update software or fundamentally change the configuration of the server, a new image should be built. The image build process should be automated and frictionless to allow for rapid deployment when necessary.

cloud-seed exists because it is often necessary to provide some "seed" data to an otherwise-ready-to-run image: some values should not be baked into the image due to them varying from server to server.

Existing solutions like cloud-init go far beyond this basic requirement, with the ability to install packages, run scripts, modify network configuration, modify partitioning and more. All of these goals can be achieved in a safer and more repeatable way either during the image build process or through appropriate configuration contained within the image, in combination with values provided via cloud-seed where necessary.

Building from source

cargo build --release
cp target/release/cloud-seed $IMAGE_ROOT/usr/bin/
cp cloud-seed.service $IMAGE_ROOT/usr/lib/systemd/system/

Pull requests to add packaging scripts are welcomed.

User data format

The user data consists of #cloud-seed followed by a newline and then a JSON object. The entire user data including the #cloud-seed header can optionally be compressed with gzip, which is automatically detected.

Two keys are currently defined for the JSON object: files, which is an array of file objects, and hostname, which is a string. Both are optional. All other keys are ignored.

For example:

#cloud-seed
{
  "hostname": "myserver",
  "files": [
    {
      "path": "/etc/foobar/foobar.conf",
      "content": "Zm9vYmFyCg==",
      "encoding": "base64",
      "owner": "root:root",
      "permissions": "0644",
      "append": false
    }
  ]
}

If hostname is set, it will be passed to sethostname(2) to set the system hostname when cloud-seed runs.

If files is set, each object in the array describes a file that will be written by cloud-seed. The supported fields are:

Field Description
path Required. The path to the file to be written. Parent directories are created as needed. Relative paths are interpreted relative to cloud-seed's working directory.
content The content to be written, encoded according to the encoding key. Defaults to no content.
encoding The encoding of the content value. Can be plain (the default), base64 or base64gzip. Rather than using the base64gzip encoding, consider compressing the entire user data with gzip.
owner The user and group that should own the file, in the format user:group. Defaults to the user that is running cloud-seed and their primary group.
permissions The mode that files should be created with, specified as an octal string. Defaults to 0644. If the file already exists, cloud-seed will not change its mode.
append If true, the content will be appended to the file if it already exists. If false (the default), the file will be truncated before the content is written.

Note about sensitive data

The user data on cloud systems is not generally appropriate for the storage of sensitive data, and therefore tools like cloud-seed (or cloud-init) should not be used to seed credentials or other sensitive data onto servers. Use functionality such as instance roles and temporary credentials to grant permissions to servers.

Supported data sources

cloud-seed can currently fetch user data from the metadata servers of:

  • Alibaba Cloud
  • AWS
  • Exoscale
  • Google Cloud
  • OpenStack
  • Oracle Cloud
  • Vultr

Pull requests are welcome to add additional data sources.

DMI data is used to automatically detect which cloud cloud-seed is running in.

Compatibility with cloud-init

For compatibility with cloud-init, YAML is supported, the #cloud-config shebang is accepted, fqdn is accepted as an alias for hostname, and write_files is accepted as an alias for files. The encoding field for each file also supports cloud-init values b64, gz+base64, gzip+base64, gz+b64 and gzip+b64 as aliases for the corresponding cloud-seed values.

All other cloud-init directives are ignored. If cloud-init compatibility is not required, it is recommended to use cloud-seed's JSON format described above.

Platform support

Linux, Windows, macOS, FreeBSD, Android (21+). Other platforms may work.

License

MIT or Apache 2.0, at your option.