/apm-game

Let the APM games begin! Can your players find your nasty performance issues?

Primary LanguageJavaScript

APM Game

Let the APM games begin! Have you ever had the evenings, where you wanted to play with your friends and families some rounds of identifying performance issues with AppDynamics, but missed the game to do that? -- Probably not, but here is APM Games anyhow, with some further use cases:

  • Build interactive demos where your audience can interact with AppDynamics themselves following your guidance.
  • Heavily inspired by demosim, you can build custom demos, if DemoMonkey does not fit all your needs.
  • Create custom screenshots for your presentations, articles, mails, ...
  • Reconstruct issues to have a reproducible environment.
  • Test configurations before bringing them into your customer's environment.
  • Show customers after a TDD, how their future implementation of AppDynamics might look like.

Features

APM Game implements and uses the following features of AppDynamics:

  • APM
    • java
    • nodejs
    • php
    • .Net Core (dotnet)
  • User Experience
    • BRUM
  • Databases
    • mySQL
  • Analytics
    • Transaction
    • Log
    • Browser Records
    • Browser Sessions
  • Infrastructure
    • Server Visibility
    • Network Visibility

Installation

Clone this project:

git clone git@github.com:AppDynamics/apm-game.git

Install all prerequisites:

Agent prerequisites (from download.appdynamics.com or the "Getting Started Wizard" of your controller):

  • PHP Agent
  • Java Agent
  • DB Agent
  • .NET Agent for Linux

Copy agent files into the directories for the java and PHP node:

cp <PATH>/appdynamics-php-agent-x64-linux-<VERSION>.tar.bz2 nodes/php
mv <PATH>/appdynamics-php-agent-x64-linux-<VERSION>.tar.bz2 infrastructure/phpproxy
mv <PATH>/AppServerAgent-<VERSION>.zip nodes/java
mv <PATH>/db-agent-<VERSION>.zip infrastructure/dbmon
mv <PATH>/AppDynamics-DotNetCore-linux-x64-<VERSION>.zip nodes/dotnetcore

Note: The agent for nodejs is installed automatically.

The docker images for the machine and network visibility agent are downloaded from docker store. You need to login to download these:

docker login

Setup an AppDynamics Platform or use your AppDynamics SaaS controller.

Usage

  1. Configure your game using YAML. You can look into the file config.yml to get started. Read the Configuration section below to learn how you can describe your application environment.

  2. Execute the run.sh

  3. Wait for data in AppDynamics

Tutorial

Before you go into the details of configuring your own environment, you can walk through this tutorial to get a quick understanding. Before you continue, follow the steps above to setup APM Game and your AppDynamics Platform.

Step 1 - Two java tiers

Open the file tutorial/step-1.yml with your favourite editor and provide the credentials to your AppDynamics controller and events service. Next, execute ./run.sh tutorial/step-1.yml. This script will first of all build all the docker files you need to run APM Game, next it will spin up some docker containers, check docker ps to see them coming up:

CONTAINER ID        IMAGE                            COMMAND                  CREATED             STATUS              PORTS               NAMES
e961843209b3        apm-game/machine                 "/usr/local/bin/mach…"   3 minutes ago       Up 3 minutes                            step-1-machine-agent
f0fb544fbb10        apm-game/java                    "/app/java.sh"           3 minutes ago       Up 3 minutes        80/tcp              step-1-frontend
7cb66fd11dee        apm-game/puppeteer               "dumb-init -- node i…"   3 minutes ago       Up 3 minutes                            step-1-browser-0
ffa7b6f2cc7d        apm-game/netviz                  "./start.sh"             3 minutes ago       Up 3 minutes                            step-1-netviz-agent
40a3487c47e6        apm-game/java                    "/app/java.sh"           3 minutes ago       Up 3 minutes        80/tcp              step-1-backend

As you can see, there are two java nodes, but also the machine agent, the network visibility agent and one container running "puppeteer" which is a headless version of Google Chrome.

If you now look into the AppDynamics UI, you will see your Business Application with two nodes (frontend, backend) and one BT (/list). You should also see both tiers on the "Network Dashboard". And finally if you enable Analytics for your application, you should get transaction events.

Compare what you see with the step-1.yml file: There are two java services (frontend, backend). The frontend has a /list http endpoint, that calls the /list/items endpoint on the backend and also calls a cache for 128ms. The endpoint /list/items just idles 1024ms before it returns a response. Below you have a single loader using puppeteer that continously calls the /list endpoint on the frontend.

Next, add a few additional functionality to your configuration. First stop the run.sh by pressing Ctrl+C, next add the following changes to your file:

  • Add another endpoint /get to the frontend tier, that calls /get/item on the backend. Do not define /get/item on the backend tier!
  • Add port: 3000 to the frontend tier as option
  • Add the new endpoint http://frontend/get to the list of urls for the loader. Also increase the count of loaders to 3.

Now, execute ./run.sh tutorial/step-1.yml again. You should see the following results:

  • A new BT /get, that has a lot of error transactions
  • You can open http://localhost:3000/get in your browser and add manual load to your frontend tier.
  • The load should increase by a factor of 3

Step 2 - 3 Tiers and a backend

Open tutorial/step2.yml and compare the content with step1. As you can see the setup is similar, but a few things are new:

  • A third tier new-frontend that is almost identical to the frontend, except that it has a different type (nodejs)
  • A fourth service called "storage", that runs without an agent.

Spin up your environment by executing ./run.sh tutorial/step-2.yml. Wait a few minutes and your flowmap should be updated: As expected, there is a new tier and a http remote service.

Again, add some additional functionality to your configuration:

  • Add another tier legacy-frontend of type php that is identical to the other two frontends.
  • Add the option aliases: [backup, www.appdynamics.com] to the storage service. Next, add calls to http://backup/item and http://www.appdynamics.com/item to any of the endpoints.
  • Use DemoMonkey to change the type of the storage remote service to fileserver

After restarting your environment, you should see an additional tier of type PHP and two more external services. Also your storage is now a fileserver!

Step 3 - Errors & Randomness

Open tutorial/step3.yml and compare the content with step2. Again, there are a few differences:

  • The backend has an error call with a probability of 10%
  • There is a new-backend and the new-frontend randomly either calls this or the old backend.

When you spin up the environment you should see those two changes on your flowmap and also in the snapshots.

Now, you are able to create performance issues with APM Game. But, again, we can add a few more things:

  • Edit storage and add the following after the sleep,500:
- call: sleep,8000
  schedule: "* */2 * * * * *"
  • Edit backend and replace - http://storage/item with:
- call: http://storage/item
  remoteTimeout: 2000

After an environment restart you should see error transactions for the communication between backend and storage. The error description should be a timeout on the backend

Step 4 - EUM

Again, compare step-3.yml with step-4.yml. This time, there is only one difference: At the top is a section for EUM configuration. Add the missing EUM key and start the environment. After a few minutes you should see some data in EUM.

Since all your issues are in the backend, the data in EUM is not very interesting, yet. To change this, add the following service to step-4.yml:

cdn:
  type: nodejs
  agent: no  
  endpoints:
    http:
      /logo.png:
        - sleep,1000      
      /script.js:
        - sleep,2000

Also, add image,http://cdn/logo.png to the /list of frontend. When you now restart your environment, EUM will report issues with the picture. In the same way you can use script,http://cdn/script.js. Finally you can use ajax,http://backend/list/items to add an ajax call to the backend.

Step 5 - Analytics

This time we start with a completely new environment. Have a look into step-5.yml. As you can see, it is rather simple, but there are a few new things:

  • Custom data is added to snapshots & analytics using the data. This command comes with a lot of flexibility, so see how the different examples behave. Note, that chance can only be used with nodejs!
  • Besides data there are some log statements, that write some entries into a application log file. You can configure Log Analytics to pick up those log files from /logs/<nodeType>/<nodeName>/node.log.
  • The loaders section has multiple steps, that look like a journey. After running the environment for a view minutes, check the EUM sessions screen, you will see that every loop is recorded as a separate session, following those steps!

Play around with the data collectors. Here are a few things you can try it:

  • Go to the chance.js homepage and try out a few of the possibilities using the nodejs frontend.
  • Add some additional log statements using different severity levels (info, debug, error, warn, trace)
  • Add an additional http endpoint to the frontend (e.g. /orderConfirmation) and add it to the loaders' loop

With those capabilities you are already able to build some good environments with business data. Of course, you might want to have some relationship in your data, for example certain products have a certain price or a certain credit card type is processed very slow. To anticipate all this possibilities, you can load a custom script from scripts/ using the script command. Review the sample.js and try to come up with your own example.

Step 6 - Run with Dynamic Attach

With step-6.yml we go back to the simple environment we had at the beginning: a frontend and a backend java application. The difference is, that we start the environment without machine agent and the application agents are also disabled, because we want to use Dynamic Attach to instrument the applications after startup. Just run ./run.sh tutorial/step-6.yml and wait until the load is generated. Next, clone the Dynamic-Agent-MA repository, edit the controller.env to contain your credentials and add the following two lines to the end:

TIER_NAME_FROM=CONTAINER_LABEL
TIER_NAME_FROM_VALUE=service-name

Execute the run.sh and wait a few minutes, then check with your AppDynamics Controller, that the agents have been attached and that you can see your application being instrumented.

Step 7 - Extend & Contribute

After you finished step 5, you should be able to leverage all the capabilities provided by APM Game. At same point, you will find some bugs or miss some features. In this case, open an issue at https://github.com/Appdynamics/apm-game/issues. If you are able to fix the bug yourself or implement the feature, please do so and provide a pull request. To get you started, follow these instructions to add some simple hello world features:

  • Open the file nodes/nodejs/index.js: this is the implementation of the nodejs agent.
  • Look for a line, that says resolve(`${call} is not supported`): above you see all the commands, that are available for nodejs.
  • Add the following else if block above the last else:
...
else if(call.startsWith('hello')) {
    resolve('hello world')
}
...
  • Add a - hello command to one of your endpoints. Use a node that has a published port
  • Run your environment
  • Call the endpoint using your browser on the configured port. The output should contain a hello world

Since you don't want to spin up all the containers to test a feature, there is a run.sh for every node type. Review them to learn how you can setup your development environment.

Configuration

Configurations for APM Game are given in YAML files. By default the run.sh looks for a file called config.yml, but you can provide another file as parameter:

./run.sh configs/myconfig.yml

The configuration has 5 top-level sections: global, apm, services, loaders and chaos:

global:
  ...
apm:
  ...
services:
  ...
loaders:
  ...
chaos:
  ...

Global

The global section is optional. It can be used to enable or disable certain features of the simulation:

  • machine: Run with AppDynamics machine agent (default: true).
  • netviz: Run with AppDynamics network visibility agent (default: true).
  • dbmon: Run with AppDynamics database monitoring agent (default: 'maybe' -- it is only run if a database node is available).
  • loaders: Run with containers that generate load (default: true).
  • services: Run with containers that provide services (default: true).
  • chaos: Run with containers that generate chaos (default: true).

Most of the time you don't need to change any of these default settings. Use them, if you for example have an independent machine agent running or if you need to test certain functionalities without having to run all other components.

APM

In the apm section you can provide all properties required to configure the AppDynamics agents. For the APM agents you need to configure the following:

  • controller: The URL of your controller, e.g. https://controller.example.com:8090
  • accountName: Your short account name, e.g. customer1
  • accountAccessKey: The access key of your account, eg. ffffffff-ffff-ffff-ffff-fffffffffff
  • applicationName: The name of the business application used by all of your agents, e.g. apm_game

If you want to use analytics capabilities, set the following:

  • eventsService: The URL of your analytics endpoints, e.g. http://analytics.example.com:9080
  • globalAccountName: Your global/long account name, e.g. customer1_ffffffff-ffff-ffff-ffff-ffffffffffff

Finally, you also can setup End User Monitoring in an eum sub-section:

  • appKey: The key of your EUM app, e.g. AD-FFF-FFF-FFF
  • adrumExtUrlHttp: URL to load the adrum-ext.js from via http, e.g. http://cdn.appdynamics.com
  • adrumExtUrlHttps: URL to load the adrum-ext.js from via https, e.g. https://cdn.appdynamics.com
  • beaconUrlHttp: URL for the beacons via http, e.g. http://col.eum-appdynamics.com
  • beaconUrlHttps: URL for the beacons via https, e.g.https://col.eum-appdynamics.com

A final apm configuration looks like following:

apm:
  controller: https://controller.example.com:8090
  accountName: customer1
  accountAccessKey: ffffffff-ffff-ffff-ffff-fffffffffff
  applicationName: apm_game
  eventsService: http://analytics.example.com:9080
  globalAccountName: customer1_ffffffff-ffff-ffff-ffff-ffffffffffff
  eum:
    appKey: 'AD-FFF-FFF-FFF'
    adrumExtUrlHttp: 'http://cdn.appdynamics.com'
    adrumExtUrlHttps: 'https://cdn.appdynamics.com'
    beaconUrlHttp: 'http://col.eum-appdynamics.com'
    beaconUrlHttps: 'https://col.eum-appdynamics.com'

services

In this section you provide all the tiers/nodes and remote services that are contained in your business application. Each sub-section is the name of a service. This name will be used to name the docker image as well as the tier within AppDynamics. Since the services use these names also to communicate with each other the name should be a valid hostname, e.g. frontend, backend-v2 or payment-provider-1, ...

A service can have the following properties:

  • type (required): Define the type of this service. You can currently use the following: java, nodejs, php, dotnet mysql and custom. Hint: Prefer nodejs for agentless services and also if you want to build a big environment, since it comes with the lowest overhead.
  • agent: Set to no or yes to disable or enable the appdynamics agent.
  • count: Set the number of instances, that will be started for this service.
  • port: Set a port which will be exposed to your docker host. So if you run locally, you can access this service via http://localhost:<port>
  • endpoints (java, nodejs, php only): Define multiple endpoints for this service. Read below to learn how to define endpoints.
  • aliases: Provide a list of network name aliases. This is useful for agentless services, that serve as multiple remote services, e.g. multiple payment providers. Hint: You can use any name for an alias, even some existing domain names (e.g. www.appdynamics.com)!
  • labels: You can provide a list of docker labels, that will be visible in the "container" view.
  • options (nodejs only): For nodejs you can set options connectionDelay and lossRate. connectionDelay will force the webserver to wait the given number of milliseconds before it accepts a connection. lossRate is a number between 0 and 1 that defines the probability, if a request is terminated early.
  • disabled: Set this to yes to temporarily disable the service without removing it from the configuration file.
  • databases (mysql only): Define multiple databases, that are created on startup on this database service. Read below to learn how to define databases and tables.
  • image (custom only): If you set the type to custom, you can define any docker image to be used for this service.

Without endpoints and databases a configuration might look like the following:

services:
  frontend:
    type: nodejs
    labels:
      version: v1.0
      dc: FRA
    agent: yes
    port: 3000
    options:
      connectionDelay: 500
    endpoints:
      ...
  backend:
    type: java
    agent: yes
    endpoints:      
      ...
  ext-payment:
    type: nodejs
    agent: no
    aliases: [ext-payment-1, ext-payment-2]
    endpoints:
      ...
  backend-db:
    type: mysql
    databases:
      ...

Endpoints

Services with type nodejs, php or java can serve multiple endpoints via different protocol (right now only http... ). Below each protocol you can list the names of the endpoints with a sequence of calls:

...
  frontend:
    ...
    endpoints:
      http:
        /login:
          ...
        /addtocart:
          ...
        /checkout:
          ...        

The call sequences below each endpoint are the simulated logic of your business application. Since the order of elements matters, you provide them in YAML list notation:

...
        /checkout:
          - http://backend/cart/checkout
          - sleep,200
          - call: error,500,Aborted
            probability: 0.1
            schedule: "* */2 * * * * *"
            - call: data
              id: price
              type: int
              value: [32,16,8]
          - ...
...

The example above first executes a call to another service, called backend, then sleeps for 200 milliseconds and afterwards an error is thrown with a probability of 10%. Here is a list of supported commands and modifiers:

  • Commands are like lines of code, that are executed by a service. You can call as many of them as you like to define an endpoint.

    • http://<service>/<endpoint>: Call another service via http.
    • sql://<service>/<database>?query=<query> (php & java): Call a database service via SQL.
    • sleep,<timeout>: Stop processing for <timeout> milliseconds. Note, that the call graph will contain a language-specific sleep method, so use it especially with agent-less services and prefer slow for those having an agent.
    • slow,<timeout>: Slow down processing by around <timeout> milliseconds. The timeout is not accurate, so it will most of the time longer than the value given in <timeout>.
    • cache,<timeout>: Call a remote service of type cache. For Java this is ehcache2, for PHP and nodejs there is no real cache implementation, but they will tell you that a redis service was called.
    • error,<code>,<message>: Throw an error with HTTP code <code> and message <message>.
    • log,<level>,<message> (java & nodejs): Write a log message. The messages are stored on a shared volume, accessible at /logs/<nodeType>/<nodeName>/node.log
    • image,<URL>: Put an <img src=<URL>> on the result page. This can be used to slow down end user responses.
    • script,<URL>: Put an <script src=<URL>> on the result page. This can be used to delay the document building time.
    • ajax,<URL>: Put an ajax call to in the result page.
    • data (java & nodejs): This is a special command to add data to a snapshot/transaction analytics. It is only available in object notation and has the following attributes:
      • call: Always set to data
      • id: A unique identifier for this data point. This will be used by AppDynamics in the snapshot/analytics view.
      • type (java only): The type of this data point. Possible values are string, int and double
      • value: The value of the data point. Use a single value or an array to add some randomness.
      • chance (nodejs only): Random fake data using chance.js
    • code,<script> (nodejs): Execute a custom script from the scripts folder. See the example provided to learn more.
  • Modifiers change the behaviour of a call. To use them provide an object notation for your call. As you can see in the example above, you can combine modifiers as you like:

    • probability: <value>: Execute this line of code with the probability of <value>. Provide a float for <value> between 0 and 1, where 0 means 0% and 1 means 100%.
    • schedule: <cron> (only nodejs): Execute this line of code, only if the given <cron> expression is matched. If you provide an expression with five fields, it is assumed that you start with minutes. If you provide seven fields, the first is assumed to be seconds and the last is assumed to be years.
    • catchExceptions: <true|false>: Use this for http requests to throw an exception if the downstream call failed, instead of ignoring them (default: true)
    • remoteTimeout: <value>: Use this for http requests to define a timeout in milliseconds. After this time the connection will be terminated and an exception will be thrown.

Databases

Services with type mysql can be setup with multiple databases and tables. Provide a list of databases with a sub-list of tables and columns to have them generated at start:

...
  backend-db:
    type: mysql
    databases:
      shop:
        carts: [id, name, value]
        customers: [id, name, email]
      fulfilment:
        ...

For the example above two databases are generated, shop and fulfilment, and within the shop database there are two tables, carts and customers with the given columns. All columns are generated as VARCHAR(255), except id which is used as primary key.

Loaders

In this section you can provide multiple load giving services. The name you provide is used to name the docker container:

loaders:
  browser:
    type: puppeteer
    wait: 15
    count: 5
    urls:
      - http://frontend/addtocart
      - http://frontend/addtocart
      - http://frontend/checkout

Currently APM Game comes with three loaders: curl, puppeteer and phantomjs. Both, puppeteer and phantomjs are headless browser, where puppeteer should be preferred. phantomjs might be deprecated in a later release.

All loaders take a list of urls and call them in the sequence given endlessly. By providing a count you can increase the number of docker instances that send load against your services. The wait parameter is used to delay the start of the load, so all your other services can be setup properly. For loaders of type phantomjs you can additionally provide a parameter adrumTimeout, that terminates a request waiting for the adrum beacon after the given time.

Note, that for each sequence of requests a unique_session_id is generated and send to the services as GET/POST parameter. A node.js frontend is picking up this value automatically for snapshots and analytics data. For PHP and Java you need to configure the data collectors in the UI.

Chaos

In this section you can provide multiple chaos generators. The name you provide is used to name the docker container:

chaos:
  pause-frontend:
    type: pumba
    interval: 1m
    target: [frontend, backend]
    command: pause
    duration: 5s  

Those chaos generators inject issues into your network or your containers based on the provided parameters. Right now, there is a single type of generator, based on pumba. To configure a chaos generator, provide the following parameters:

  • type: pumba: Set the type of the chaos generator. Currently this is optional, since there is only a single chaos generator type.
  • interval: <time>: Set the time interval which is used to periodically generate chaos. Use with optional unit suffix: 'ms/s/m/h'
  • duration: <time>: Set the duration of the chaos generation. Use with optional unit suffix: 'ms/s/m/h'
  • target: <array|string>`: Set the target service or services by name.
  • command: <pause|netem-delay|netem-loss>: Provide one of the commands supported. Depending on the command you need to provide further arguments.
  • probability: <value> (netem-loss only): Set the probability for package loss.
  • time: <value> (netem-delay only): Set the delay time in milliseconds.

Develop

If you'd like to contribute to this project, feel free to provide issues or pull requests.

If you'd like to advance a node type, use the run.sh provided for each, to have a local version running. By default it will use the file frontend.json and appdynamics.json as configuration.

The easiest way to add functionality, is adding a new "command". For example, if you want to add a noop command for nodejs add the following code in the if-elseif-else-block:

...
} else if (call.startsWith('noop')) {  
  resolve('')
}
...

If you want to add a new node type, create a new folder with a Dockerfile and all the other things that you might require. Check the nodejs, php, java or mysql implementation to get some insights.