Skip to content

Rename prometheus missing from cluster notifications to be more helpful

Summary

The alert that informs the EOC that there are no operating prometheus pods in a cluster could be re-titled to specify that pods are missing and not that the cluster may be missing.

Related Incident(s)

Originating issue(s)😞 production#5998

Desired Outcome/Acceptance criteria

Getting paged with an alert containing GKE gprd-us-east1-b has gone missing no longer strikes fear into the heart of the EOC.

Associated Services

Corrective Action Issue Checklist

  • link the incident(s) this corrective action arose out of
  • give context for what problem this corrective action is trying to prevent from re-occurring
  • assign a severity label (this is the highest sev of related incidents, defaults to 'severity::4')
  • assign a priority (this will default to 'priority::4')