/awesome-monitoring

Awesome monitoring page

GNU General Public License v3.0GPL-3.0

Awesome Monitoring

A list of awesome tools, services and other shiny things for monitoring and analyze everything in your app.

Application Performance monitoring (APM)

Products

  • NewRelic - complex service for both application and infrastructure monitoring
  • DataDog - complex service for both application and infrastructure monitoring
  • OverOps - OverOps provides Automated Root Cause (ARC) analysis to reduce the time to identify and fix critical production application errors.
  • AppDynamics - Business and application performance monitoring.
  • Java Melody - Monitoring of JavaEE applications.
  • Kamon - Monitoring applications running on the JVM.
  • SPM - solutions for performance monitoring.
  • OverOps - Know when and why code breaks in production.
  • Instrumental - Real-time application and server monitoring
  • Zipkin - A distributed tracing system.
  • PinPoint - Open source APM tool for large-scale distributed systems written in java.

Combined Solutions

Modern Monitoring tools for devops, container, serverless

Collect

Collect and transport, metrics and events.

  • OpenCensus - OpenCensus is a set of libraries for various languages that allow you to collect application metrics and distributed traces, then transfer the data to a backend of your choice in real time.
  • Opentracing - Vendor-neutral APIs and instrumentation for distributed tracing
  • Opentelemetry - OpenTelemetry is made up of an integrated set of APIs and libraries as well as a collection mechanism via an agent and collector.
  • Micrometer - Micrometer provides a simple facade over the instrumentation clients for the most popular monitoring systems, allowing you to instrument your JVM-based application code without vendor lock-in. Think SLF4J, but for metrics.
  • inspectIT Ocelot - Java agent for collecting performance, tracing and business data.
  • telegraf github - TICK stack, The plugin-driven server agent for collecting & reporting metrics.
  • node-exporter github - Prometheus stack, Exporter for machine metrics.
  • beats github - Lightweight shippers for Elasticsearch & Logstash, Elastic stack
  • logstash github - Transport and process your logs, events, or other data, Elastic stack.
  • collectd - The system statistics collection daemon.
  • tcollector github - Data collection framework for OpenTSDB
  • falcon-plus github - An open-source and enterprise-level monitoring system.

Storage

  • influxdata - influxdb, written in go.
  • prometheus - The Prometheus monitoring system and time series database.
  • elasticsearch - Open Source, Distributed, RESTful Search Engine, written in java.
  • OpenTSDB - OpenTSDB, written in java.
  • kairosDB - KairosDB.
  • Graphite - More, than a time series database. And so awesome using with Grafana.

Graphing

Alerting

Tranditional Monitoring tools.

  • Nagios - Computer system, network and infrastructure monitoring software application.
  • Zabbix - real-time monitoring of millions of metrics collected from tens of thousands of servers, virtual machines and network devices
  • Ganglia - Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids.
  • Zenoss - Zenoss works with the world's largest companies to ensure their IT services and applications are always on. As the global leader in hybrid IT monitoring and analytics software, Zenoss provides complete visibility for cloud, virtual and physical IT environments.
  • Opserver - Stack Exchange's monitoring system.
  • netdata - netdata is a system for distributed real-time performance and health monitoring. It provides unparalleled insights, in real-time, of everything happening on the system it runs (including applications such as web and database servers), using modern interactive web dashboards.
  • Icinga - Monitoring core with features and REST API. Fork of Nagios.
  • Naemon - Network monitoring tool based on the Nagios 4 core with performance enhancements and new features.
  • Shinken - Shinken is a monitoring framework. It's a Python Nagios® Core total rewrite enhancing flexibility and large environment management.
  • Centreon - IT infrastructure and application monitoring for service performance.
  • Opsview - Based on Nagios 4, Opsview Core is ideal for small IT and test environments.
  • Check_mk - Check_MK is comprehensive IT monitoring solution in the tradition of Nagios. Collection of extensions for Nagios.
  • Sensu - Open source monitoring framework.
  • cabot github - Self-hosted, easily-deployable monitoring and alerts service.
  • server density - Server monitoring.
  • pagerduty - Digital Operations Management.

Logging

  • Brubeck - Statsd-compatible stats aggregator written in C

Uptime

  • Monitive - free for 1 service, checked every 10 minutes with unlimited email & twitter alerts
  • UptimeRobot - free for 50 monitors, checked every 5 minutes

Web Analytics

  • Piwik - open source alternative to Google Analytics
  • Heap Analytics - easy event tracking without coding

Bug Tracking

  • Sentry - Application monitoring, event logging and aggregation.

Anomalies

  • Banshee - real-time anomalies(outliers) detection system for periodic metrics

Alerting

  • Moira - Most powerful alerting system, backed by Graphite.
  • Alerta - Distributed, scaleable and flexible monitoring system.
  • Flapjack - Monitoring notification routing & event processing system.
  • Seyren - An alerting dashboard for Graphite.

Database

Tools for databases

Network

  • Cacti - Web-based network monitoring and graphing tool.
  • Observium - SNMP monitoring for servers and networking devices. Runs on linux.
  • LibreNMS - fork of Observium.
  • Munin - Networked resource monitoring tool.

Operating System

  • Dash - A low-overhead monitoring web dashboard for a GNU/Linux machine.
  • Monit - Small Open Source utility for managing and monitoring Unix systems.

References