/jmxtrans

Visualizing JVM metrics with JMXTrans -> InfluxDB -> Grafana

Primary LanguageShell

This Repo Contains JMXTrans and InfluxDB related content

JMXTrans:

jmxtrans is a tool which allows you to connect to any number of Java Virtual Machines (JVMs) and query them for their attributes without writing a single line of Java code. The attributes are exported from the JVM via Java Management Extensions (JMX). Most Java applications have made their statistics available via this protocol and it is possible to add this to any codebase without a lot of effort. If you use the SpringFramework for your code, it can be as easy as just adding a couple of annotations to a Java class file.

High Level Architecture:



            Client                                       
          +---------+                    +--------------+              
          |  JVM    |                    |              |              +-------------+               +-------------+  
          |  Tomcat |                    |              |              |             |               |             |
          |  Solr   |    queries         |   jmxtrans   |              |             |               |             |  
          |  AMQ    +------------------->|   server     +------------->|  influxDB   |+------------->|   Grafana   |
          |  Camel  |                    |              |              |             |               |             |  
          |  Kafka  |                    |              |              |             |               |             |
          |  Sys    |                    |              |              +-------------+               +-------------+  
          +---------+                    +--------------+                                            

For more info : https://github.com/jmxtrans/jmxtrans/wiki

Influxdb:

InfluxDB is an open-source time series database developed by InfluxData. It is written in Go and optimized for fast, high-availability storage and retrieval of time series data in fields such as operations monitoring, application metrics, Internet of Things sensor data, and real-time analytics.

For more info : https://docs.influxdata.com/influxdb/v1.2/

Installation Steps, Manually:

influxDB : 1.2.2
jmxtrans : 264
Grafana  : 4.1.2

Check-out jmxtrans repo:

git clone git@github.com:shahsaifi/jmxtrans.git /opt/jmxtrans

cd /opt/jmxtrans

Repo Tree structure:

jmxtrans
├── bin
│   ├── jmxtrans.sh
│   └── yaml2json.py
├── conf.d
│   ├── config.yaml
│   ├── prod_host_set_1.json
│   └── prod_host_set_2.json
├── lib
│   ├── jmxtrans-264-all.jar
│   └── jmxtrans-all.jar -> jmxtrans-264-all.jar
├── logs
│   └── jmxtrans.log
└── README.md

influxdb:

  1. Download & Install influxDB:
wget https://dl.influxdata.com/influxdb/releases/influxdb-1.2.2.x86_64.rpm
sudo yum localinstall influxdb-1.2.2.x86_64.rpm
  1. Update /etc/influxdb/influxdb.conf with influxdb.conf in this repo:
sudo cp -avp /etc/influxdb/influxdb.conf /etc/influxdb/orig_influxdb.conf_$(date +'%Y%m%d%H%M')
cat influxdb.conf > /etc/influxdb/influxdb.conf
  1. Start & Enable influxDB:
systemctl enable influxdb
systemctl start influxdb
  1. Check influxDB status:
systemctl status influxdb
  1. Login to influxdb:
[root@jmxtrans01 ~]# influx
Connected to http://localhost:8086 version 1.2.2
InfluxDB shell version: 1.2.2
>
  1. Create jmx database with 52 weeks retention:
> create database jmx with duration 52w
  1. Check retention:
> show retention policies on jmx
  1. Create db user:
> CREATE USER jmx WITH PASSWORD 'jmx' WITH ALL PRIVILEGES ;
  1. Verify User:
> show users
  1. Grant full privileges on db:
GRANT ALL ON jmxDB to jmx
  1. Verify privileges:
> show grants for jmx

jmxtrans

Hosts are distinguished based on the metrics and YAML generates multiple giant JSON files so it is highly recommended to use config.yaml to update/modify object definitions. DO NOT FORGET TO CHECK-IN CHANGES TO REPO..

  1. Set JAVA_HOME environment variable after installing JDK.
cd ; wget --no-cookies --no-check-certificate --header "Cookie: gpw_e24=http%3A%2F%2Fwww.oracle.com%2F; oraclelicense=accept-securebackup-cookie" "http://download.oracle.com/otn-pub/java/jdk/8u60-b27/jdk-8u60-linux-x64.rpm"
sudo yum localinstall jdk-8u60-linux-x64.rpm
rm ~/jdk-8u60-linux-x64.rpm
echo "export JAVA_HOME=/usr/java/jdk1.8.0_121" >> /etc/environment
  1. change path to /opt/jmxtrans
cd /opt/jmxtrans
  1. Fetch jmxtrans jar:
wget http://central.maven.org/maven2/org/jmxtrans/jmxtrans/264/jmxtrans-264-all.jar -P lib
  1. Verify jar in lib dir:
[root@jmxtrans01 jmxtrans]# pwd
/opt/jmxtrans

[root@jmxtrans01 jmxtrans]# ls -l lib/
total 32600
-rw-r--r-- 1 root root 33380575 Mar 30 04:55 jmxtrans-264-all.jar
lrwxrwxrwx 1 root root       20 Apr 21 00:58 jmxtrans-all.jar -> jmxtrans-264-all.jar
  1. Update repo content:
git pull origin <branch_name>
  1. start jmxtrans:
/opt/jmxtrans/bin/jmxtrans.sh start
  1. tail logs to see progress:
tail -f /opt/jmxtrans/logs/jmxtrans.log
  1. Once you are confirmed, jmxtrans is running with current configs placed conf.d with logs, move further to set-up Grafana UI dashboard to add data-source.

Installation Steps, with install.sh:

  1. After check-out this repo in /opt
[root@jmxtrans01 jmxtrans]# pwd
/opt/jmxtrans
  1. Run install.sh:
[root@jmxtrans01 /opt/jmxtrans]# ./install.sh

Adding/updating jmx metrics :

  1. Refer jmxtrans/conf.d/config.yaml which contains thoroughly commented YAML definitions about metrics e.g.:
- name: classloading
  obj: "java.lang:type=ClassLoading"
  resultAlias: "classloading"
  typeNames:
      - type
  attr:
      - "TotalLoadedClassCount"
      - "LoadedClassCount"
      - "UnloadedClassCount"
  1. Read comments carefully to understand the parameters and connect to jmxconsole with host port to relate objects with attributes:
jconsole dc1-host01.shah.int:1099
  1. It is always better to validate YAML with linter like yamllint (Warnings can be ignored):
⇒ pip install yamllint

⇒  yamllint conf.d/config.yaml
conf.d/config.yaml
  85:30     warning  too few spaces before comment  (comments)
  85:31     warning  missing starting space in comment  (comments)
  86:33     warning  too few spaces before comment  (comments)
  1. After modifying config.yaml and linting, re-generate JSON files with:
[root@jmxtrans01 conf.d]# pwd
/opt/jmxtrans/conf.d

[root@jmxtrans01 conf.d]# ../bin/yaml2json.py --file config.yaml
  1. check-in updates to git:
git commit -a -m 'updating classloading object'
git push origin <branch_name>
  1. Restart jmxtrans on server:
/opt/jmxtrans/bin/jmxtrans.sh restart
  1. Tail log file:
tail -f /opt/jmxtrans/logs/jmxtrans.log
  1. Log-in to influxDB or use rest API to verify new metrics data:
# curl -v -G 'http://jmxtrans01.shah.com:8086/query?pretty=true' --data-urlencode "db=jmxDB" --data-urlencode "q=select * from "amq" where "hostname" = 'dc1-host01_shah_com' limit 1"
* About to connect() to jmxtrans01.shah.com port 8086 (#0)
*   Trying 10.20.23.74...
* Connected to jmxtrans01.shah.com (10.20.23.74) port 8086 (#0)
> GET /query?pretty=true&db=jmxDB&q=select%20%2A%20from%20amq%20where%20hostname%20%3D%20%27dc1-host01_shah_com%27%20limit%201 HTTP/1.1
> User-Agent: curl/7.29.0
> Host: jmxtrans01.shah.com:8086
> Accept: */*
>
< HTTP/1.1 200 OK
< Connection: close
< Content-Type: application/json
< Request-Id: f4507314-28fb-11e7-badb-000000000000
< X-Influxdb-Version: 1.2.2
< Date: Mon, 24 Apr 2017 14:40:25 GMT
< Transfer-Encoding: chunked
<
{
    "results": [
        {
            "statement_id": 0,
            "series": [
                {
                    "name": "amq",
                    "columns": [
                        "time",
                        "AverageEnqueueTime",
                        "ConsumerCount",
                        "DequeueCount",
                        "DispatchCount",
                        "EnqueueCount",
                        "MemoryPercentUsage",
                        "ProducerCount",
                        "QueueSize",
                        "_jmx_port",
                        "attributeName",
                        "className",
                        "hostname",
                        "objDomain",
                        "typeName"
                    ],
                    "values": [
                        [
                            "2017-04-21T14:21:11.091Z",
                            null,
                            null,
                            0,
                            null,
                            null,
                            null,
                            null,
                            null,
                            1099,
                            "DequeueCount",
                            "org.apache.activemq.broker.jmx.QueueView",
                            "dc1-host01_shah_com",
                            "org.apache.activemq",
                            "BrokerName=localhost,Type=Queue,Destination=com.apache.fake.queue"
                        ]
                    ]
                }
            ]
        }
    ]
}
* Closing connection 0

Kafka Metrics

After enabling JMX refer conf.d/kafka_config.yaml

InfluxDB : Backup and Restore

InfluxDB has the ability to snapshot an instance at a point-in-time and restore it. All backups are full backups. InfluxDB does not yet support incremental backups. There are two types of data to backup, the metastore and the metrics themselves. The metastore is backed up in its entirety. The metrics are backed up per-database in a separate operation from the metastore backup.

Each database must be backed up individually.

Note: Metastore backups are also included in per-database backups. Following commands were issued from dc1-fluxdb1.shah.com.

  1. Take a backup of the autogen retention policy for the jmx database by using the command:
influxd backup -host 192.168.18.83:8088 -database jmx -retention autogen /metrics/backups/jmxDB
  1. The backup can be restored in two steps. First, the metastore needs to be restored so that InfluxDB knows which databases exist:
influxd restore -metadir /metrics/influxdb/meta jmxDB
  1. Once the metastore has been restored, we can now recover the backed up data. We backed up the jmx database to /metrics/backups/jmxDB, so let’s restore that same dataset. To restore the jmx database:
influxd restore -database jmx -datadir /metrics/influxdb/data jmxDB
  1. Once the backed up data has been recovered, the permissions on the shards may no longer be accurate. To ensure the file permissions are correct, please run:
chown -R influxdb:influxdb /metrics/influxdb
  1. Once the data and metastore are recovered, it’s time to restart the database:
service influxdb restart