/serverless-docker-images-analytics

Serverless Analytics app for analyzing docker image layers

Primary LanguageTypeScript

Serverless Docker Image Layers Analytics

It's a fun project to setup a serverless Analytics app in AWS to analyze the layers of docker images.

The app is powered by Amazon S3, AWS Glue and Amazon Athena.

How to deploy

Prerequisites

  • Install Node LTS(such as 12.x)
  • Configure your AWS account for awscli

Deploy it

# install dependencies & init cdk toolkit
# only need run once
npm run init

# deploy
npm run deploy

Cleanup

npm run cleanup

How to analyze the data

  • Login the AWS console with your account, Go to Athena
  • In Query Editor, selction database docker_image_db, then select context menu Load Partitions from table layers
  • Click the Saved Queries to find the built-in analysis queries starting with Docker_Layers_Stats

Enjoy it!

Disclaimer -- about image layers data

This project provides few pilot layers data of some official Docker images of Docker hub, the data was fetched by the simple script. This project does NOT guarantee the integrity of layers data and provides the continuous maintenance.

You are free to use this project and the script, make sure not violating the user agreements of Docker hub.