codepipeline-artifact-munge

CodePipeline uses a series of "stages", each consisting of one or more "actions", where the outputs of one action can be used as inputs for subsequent actions. These outputs are called "artifacts" and are stored in an S3 bucket as Zip files.

Unfortunately, some actions only ollow a single artifact to be used as input. For example, the CodePipeline Build action only allows one artifact to be used as input. This causes a problem if you want to build an application that combines source files from more than one location.

In other cases, only some of the files in an artifact should be passed to an action.

This project provides a general purpose Lambda that can be inserted into a AWS CodePipeline to allow artifacts to be merged or to allow files from an artifact to be extracted, in each case creating a new artifact.

A Typical Use Case

At ToolTwist we use CodePipeline to deploy into CI, test, staging and production environments running on Amazon ECS.

In earlier times we used the same Docker image in CI, test, staging, etc, and injected the environment-specific config files by copying them into the Docker hosts and mounting them into the application containers as Docker volumes. This was a multi-step process, prone to error, and introducing a larger security-sensitive surface area than we would like.

Using CodePipeline we now generate a new Docker image for each environment, with the environment-specific config files baked in.

Application code comes from Github.
Infrastructure config and scripts come from a carefully protected AWS CodeCommit repo.
A Merge stage in our CodePipeline combines source code and config files before the Build stage, so it can create Docker images with their configurations baked in.
An Extract stage extracts the deployment scripts from the CodeCommit artifact and passes it along to the Deployment stage.

This provides a simpler process and security benefits:

No sensitive information is located in Github, S3 buckets, or other locations that may be accidentally made public.
Docker images remain safely hidden away within the AWS environment.
'Baking-in' the configs removes the complexities of mounting volumes in an ECS environment.
In production, the Docker containers many levels deep, behind the subnets, security groups, and ECS hosts.

In summary, the deployment process is locked down from start to finish, with minimal passing around of credentials, and other sensitive information.

Preparing the Lambda Source

This Lambda is available in our public S3 bucket, but we recommend you build it from scratch yourself.

Clone this repository onto your machine
Run npm run build to create CodepipelineArtifactMunge.zip
Upload this Zip file into an S3 bucket that can be accessed from your CodePipelines.
Include the Lambda in your CodePipeline.

Feel free to give your S3 bucket public access if it's more convenient - this project is open source.

Using the Lambda via Cloudformation

If you are using Cloudformation to create you CodePipeline, the following may provide a guide.

First you need to define a role for the lambdas to use. This may include a few more permissions than required - feel free to let me know.

# From https://stelligent.com/2016/02/08/aws-lambda-functions-aws-codepipeline-cloudformation/
CodePipelineLambdaRole:
  Type: AWS::IAM::Role
  Properties:
    RoleName: !Sub "nbt-${EnvironmentName}-${ApplicationName}-CodePipelineLambdaRole"
    Path: /
    AssumeRolePolicyDocument: |
      {
        "Statement": [{
          "Effect": "Allow",
          "Principal": { "Service": [ "lambda.amazonaws.com" ]},
          "Action": [ "sts:AssumeRole" ]
        }]
      }
    Policies:
      - PolicyName: root
        PolicyDocument:
          Version: 2012-10-17
          Statement:
            - Resource:
                - !Sub arn:aws:s3:::${ArtifactBucket}/*
                - !Sub arn:aws:s3:::nbt-${EnvironmentName}-configs
                - !Sub arn:aws:s3:::nbt-${EnvironmentName}-configs/*
              Effect: Allow
              Action:
                - s3:PutObject
                - s3:GetObject
                - s3:GetObjectVersion
                - s3:GetBucketVersioning
            - Resource: "*"
              Effect: Allow
              Action:
                - lambda:*
                - codecommit:GetBranch
                - codecommit:GetCommit
                - codecommit:UploadArchive
                - codecommit:GetUploadArchiveStatus
                - codecommit:CancelUploadArchive
                - codebuild:StartBuild
                - codebuild:BatchGetBuilds
                - cloudformation:*
                - iam:PassRole
                - codepipeline:PutJobSuccessResult
                - codepipeline:PutJobFailureResult
                - lambda:Listfunctions
            - Resource: "*"
              Effect: Allow
              Action:
                - logs:CreateLogGroup
                - logs:CreateLogStream
                - logs:PutLogEvents

Next, define the MergeLambda and ExtractLambda functions. In this example the CodepipelineArtifactMunge.zip file has been uploaded to an S3 bucket named nbt-lambda. You should replace that with the name of your bucket.

# Lambda to merge artifacts
# Reference: https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/aws-resource-lambda-function.html
MergeLambda:
  Type: AWS::Lambda::Function
  Properties:
    FunctionName: !Sub "nbt-${EnvironmentName}-${ApplicationName}-merge"
    Code:
      S3Bucket: "nbt-lambdas"
      S3Key: "CodepipelineArtifactMunge.zip"
    Role: !GetAtt CodePipelineLambdaRole.Arn
    Description: "Lambda function to merge artifacts in a CodePipeline"
    Timeout: 30
    Handler:  "merge.handler"
    Runtime: "nodejs6.10"
    MemorySize: 128

# Lambda to extract artifacts
ExtractLambda:
  Type: AWS::Lambda::Function
  Properties:
    FunctionName: !Sub "nbt-${EnvironmentName}-${ApplicationName}-extract"
    Code:
      S3Bucket: "nbt-lambdas"
      S3Key: "CodepipelineArtifactMunge.zip"
    Role: !GetAtt CodePipelineLambdaRole.Arn
    Description: "Lambda function to extract from artifacts in a CodePipeline"
    Timeout: 30
    Handler:  "extract.handler"
    Runtime: "nodejs6.10"
    MemorySize: 128

Inputs to MergeLambda:

Artifact 1 - a zip file in the S3 bucket
Artifact 2 - also a zip file in the S3 bucket
UserParameters - a path where Artifact 2 files will be inserted

The output artifact will be Artifact 1, with the entire contents of Artifact 2 inserted at the specified location.

Inputs to ExtractLambda:

An Artifact - a zip file in the S3 bucket
UserParameters - a comma separated list of file names to extract from the artifact.

The output will be an artifact containing only the specified files.

Finally, to use these functions in your CodePipeline.

Pipeline:
  Type: AWS::CodePipeline::Pipeline
  Properties:
    ...
    Stages:
      ...

      # Merge stage
      # This merges the App and SecureConfig artifacts
      # See https://dzone.com/articles/running-aws-lambda-functions-in-aws-codepipeline-u
      - Name: Merge
        Actions:
          - Name: "app-and-config"
            ActionTypeId:
              Category: Invoke
              Owner: AWS
              Version: 1
              Provider: Lambda
            Configuration:
              FunctionName: !Ref MergeLambda
              UserParameters: "secure-config"
            InputArtifacts:
              - Name: App
              - Name: SecureConfig
            OutputArtifacts:
              - Name: AppAndConfig
            RunOrder: 1

      ...

      # Extract stage
      # This extracts the deployment template from the SecureConfig artifact
      # See https://dzone.com/articles/running-aws-lambda-functions-in-aws-codepipeline-u
      - Name: Extract
        Actions:
          - Name: "deploy-template"
            ActionTypeId:
              Category: Invoke
              Owner: AWS
              Version: 1
              Provider: Lambda
            Configuration:
              FunctionName: !Ref ExtractLambda
              UserParameters: "Deploy/service.cf"
            InputArtifacts:
              - Name: SecureConfig
            OutputArtifacts:
              - Name: DeployTemplate2
            RunOrder: 1

Deploying the Lambda by hand

The Lambda functions can be defined by hand by loading the Jar file directoly into the Lambda Management Console. Make sure you define the Runtime and Handler as shown below. For the Merge function use merge.handler.

Once you've uploaded the Zip file, don't forget the publish the lambda function (under the Actions menu).

Once the lamba is published it can be included into a CodePipeline by pressing Edit and adding an action.

Contributing