keikoproj/active-monitor

Addl metric indicating start and end times of latest run for each healthcheck

davemasselink opened this issue · 1 comments

Is your feature request related to a problem? Please describe.
first discussed in this slack thread: https://intuit-teams.slack.com/archives/GBLA5J9DH/p1579889119031100

Describe the solution you'd like
Controller should expose the start and end times of latest run for each healthcheck as a metric. This would assist in cluster issue debugging/diagnosis since it will be more obvious when work/traffic/etc. is happening due to a healthcheck rather than organic work/traffic.

Currently this can be determined only by looking at the healthcheck status and, even then, only completion times are tracked

Yet to discuss
Are timestamps the best piece of data to track? Otherwise would an "ongoing" boolean and "lastRunDuration" float be easier to make sense of?

handled in #35. Thanks @RaviHari!