Approach

As you'll notice, I chose to use kubernetes as the orchestration layer. I made a few small changes to the apps.

Added health endpoints
Using gunicorn
swapped to mongodb

I could have thrown the sqlite db on a volume and mounted it to each instance, but I chose to swap it out for a distributed database. Though, admittedly, I deployed this project to a single node mongo instance rather than a replica set. And I chose mongo vs mysql/pgsql because I already had I already had a few instances deployed to this cluster.

Notes:

You will also see that I temporarily deployed an ingress mounting App B at /auth-svc on the App A ingress for initial deploy testing. I left that file in the repo, commenting it out rather than deleting for easier inspection.
I chose not to squash any commits or didn't use any branches per the readme's note about progress. Many of these commits would have been appropriate for sqashing or condensing through branches/PRs.

Proof

You can access the service at https://app-a.halo.sh, as there is no root view in the flask app, I'd suggest hitting /hello, /healthz or running your suggested test(s). This is deployed to a small kubernetes

Autoscaling

I'm using the horizontal auto scaling resource in kubernetes to scale both apps. You'll notice app A and B are configured slightly different in this respect. App B is using a CPU target percentage which is one of the two default pod resource metrics, this works in any recent kubernetes cluster without a custom metrics API. App A is configured for using the custom metrics API to look at requests per second across the ingress as well as a target CPU usage.

Here you can see app-a scaling up during an ab run.

kubectl get deployment app-a --watch

NAME    READY   UP-TO-DATE   AVAILABLE   AGE
app-a   1/1     1            1           8h
app-a   1/3     1            1           8h
app-a   1/3     1            1           8h
app-a   1/3     1            1           8h
app-a   1/3     2            1           8h
app-a   1/3     3            1           8h
app-a   2/3     3            2           8h
app-a   3/3     3            3           8h

CI/CD

Given the allotted time, I did not setup CI/CD for the project, but it'd be straight forward to throw in your CI/CD tool(s) of choice. I also didn't version the apps or their docker images as I didn't want to juggle tags without a ci/cd pipeline, that's a recipe for frustration. Yep, for the sake of time I'm using 'latest' image tags and making rapid iteration to my 'production' environment. Don't use 'latest' in prod. Also, version your stuff.

To deploy a new version, your deploy tool would need to get the images to your docker repo (which hopefully your build/test pipline has already done) and then the manifest needs to have the tag bumped. Kubernetes will do a rolling deploy as soon as it receives a new desired state. You'll notice I added and am using a health endpoint in both apps to help facilitate zero downtime state changes.

My manual pipeline consisted of running buildall.sh (included in the repo) which built each app's image and pushed them to my personal docker repo (which is also running in kubernetes, in the same cluster this project was deployed to).

Testing...

Single request

→ curl -X POST -H 'Authorization: mytoken' https://app-a.halo.sh/jobs
Jobs:
Title: Devops
Description: Awesome

Running apache bench against the service.

Ignore the high-ish latency, this cluster is on the other side of the country from me and its not the fastest hw