This is a simple example of how you can achieve:
- version control of your recipes
- versioning to include image hash and commit id
- build of associated container and
- push to a storage endpoint
for a reproducible build workflow.
Why should this be managed via Github?
Github, by way of easy integration with continuous integration, is an easy way to have a workflow set up where multiple people can collaborate on a container recipe, the recipe can be tested (with whatever testing you need), discussed in pull requests, and then finally pushed to your storage of choice or Singularity Registry. Importantly, you don't need to give your entire team manager permissions to the registry. An encrypted credential that only is accessible to administrators can do the push upon merge of a discussed change.
Why should I use this instead of a service?
You could use a remote builder, but if you do the build in a continuous integration service you get complete control over it. This means everything from the version of Singularity to use, to the tests that you run for your container. You have a lot more freedom in the rate of building, and organization of your repository, because it's you that writes the configuration. Although the default would work for most, you can edit the build, setup, and circle configuration file in the .circleci folder to fit your needs.
Add your Singularity recipes to this repository, and edit the build commands in the build.sh file. This is where you can specify endpoints (Singularity Registry, Dropbox, Google Storage, AWS) along with container names (the uri) and tag. You can build as many recipes as you like, just add another line!
# recipe relative to repository base
- /bin/bash .circleci/build.sh Singularity
- /bin/bash .circleci/build.sh --uri collection/container --tag tacos --cli google-storage Singularity
- /bin/bash .circleci/build.sh --uri collection/container --cli google-drive Singularity
- /bin/bash .circleci/build.sh --uri collection/container --cli globus Singularity
- /bin/bash .circleci/build.sh --uri collection/container --cli registry Singularity
For each client that you use, required environment variables (e.g., credentials to push, or interact with the API) must be defined in the (encrypted) Travis environment. To know what variables to define, along with usage for the various clients, see the client specific pages
You can clone and tweak, but it's easiest likely to get started with our example files and edit them as you need.
We will be working with Circle CI. You can see example builds for this repository here.
- Circle offers scheduled builds.
- CircleCI also offers GPU Builders if you want/need that sort of thing.
- If you don't want to use the sregistry to push to Google Storage, Drive, Globus, Dropbox, or your personal Singularity Registry, CircleCI will upload your artifacts directly to your deployment location of choice.
For the example here, we have a single recipe named "Singularity" that is provided as an input argument to the build script. You could add another recipe, and then of course call the build to happen more than once. The build script will name the image based on the recipe, and you of course can change this. Just write the path to it (relative to the repository base) in your .circleci/config.yml.
The recipe uses the Singularity Orb to install your chosen version of Singularity. If you want to change the version, just adjust the parameter here:
- build:
name: "Singularity 3.2.1 - Python 3"
singularity: 3.2.1
singularity-3: true
The basic steps to setup the build are the following:
We also install the sregistry client that allows you to issue a command like "sregistry push ..." to upload a finished image to one of your cloud / storage endpoints. In this basic example, the push won't happen, and you will just build an image using the CI.
The basic steps for the build are the following:
- Running build.sh with no inputs will default to a recipe called "Singularity" in the base of the repository. You can provide an argument to point to a different recipe path, always relative to the base of your repository.
- If you want to define a particular unique resource identifier for a finished container (to be uploaded to your storage endpoint) you can do that with
--uri collection/container
. If you don't define one, a robot name will be generated. - You can add
--uri
to specify a custom name, and this can include the tag, OR you can specify--tag
to go along with a name without one. It depends on which is easier for you. - If you add
--cli
then this is telling the build script that you have defined the needed environment variables for your client of choice and you want successful builds to be pushed to your storage endpoint. See here for a list of current client endpoints, or roll your own!
See the config.yml for examples of this build.sh command (commented out). If there is some cloud service that you'd like that is not provided, please open an issue.
If you go to your Circle Dashboard you can usually select a Github organization (or user) and then the repository, and then click the toggle button to activate it to build on commit --> push.
That's it for the basic setup! At this point, you will have a continuous integration service that will build your container from a recipe each time that you push. The next step is figuring out where you want to put the finished image(s), and we will walk through this in more detail.
Once the image is built, where can you put it? An easy answer is to use the Singularity Global Client and choose one of the many clients to add a final step to push the image. You then use the same client to pull the container from your host. Once you've decided which endpoints you want to push to, you will need to:
- Save the credentials / other environment variables that your client needs (see the client settings page linked in the sregistry docs above) to your repository settings where they will be encrypted and in the environment.
- Add a line to your .circleci/config.yml to do an sregistry push action to the endpoint(s) of choice. We have provided some (commented out) examples to get you started.
Remember that you can also take advantage of deployment options that CircleCI offers, or do any other action that you might want for the reproducibility or archive of metadata of your builds. We save the build folder as an artifact to the repository, but the containers might be too big to do this.
- This setup can work as an analysis node as well! Try setting up a scheduled build to build a container that processes some information feed, and you have a regularly scheduled task.
- run builds in parallel and test different building environments. You could try building the "same" container across different machine types and see if you really do get the same thing :)
- You can also do other sanity checks like testing if the container runs as you would expect, etc.