netdata/netdata-cloud

[Feat]: Flood protection alert nonfiction should give more context

Closed this issue · 6 comments

Problem

When Netdata Cloud triggers a Flood Protection notification it isn't possible to have an idea what could be main cause of it

image

Description

If some more context was added, similar to what is on the Home tab, users could understand if:

  • one machine is having some network issue or CPU consumption spikes that could be flip-flopping the alerts, or
  • major incident is happening and multiple machines are affected

image

Importance

really want

Value proposition

  1. Users would immediately be able to assess the importance of what's happening

Proposed implementation

Add some stats like the Home tab for:

  • Nodes with most alerts in the last X hours/minutes
  • Top alerts in the last X hours/minutes

@car12o : Do we have some statistical information available on the offender for notifications so we can add it to the flood protection template?

@car12o : Do we have some statistical information available on the offender for notifications so we can add it to the flood protection template?

yes, flood protection is per account/email and we have the latest notifications (node + alarm name + chart) we send to the account.

@car12o : Do you mean we already have a new notification template that sends this information to the account? I still see this one on my mailbox.
image

@car12o : Do you mean we already have a new notification template that sends this information to the account? I still see this one on my mailbox.

no, I meant we have this data but the email content is not enriched with that.

Thanks @car12o : I have added this to our initiative list and we can pick it up when we are done with some of the higher priority stuff.

car12o commented

deployed on all envs