Surface metrics charts on the alert detail page for alerts from Externally-managed Prometheus
Problem to solve
Charts help users visualize what went wrong while triaging alerts. If a particular threshold was exceeded, we can reduce time spent during investigation by automatically including the relevant metrics chart in the alert.
Intended users
User experience goal
Have metrics available as part of the alerts workflow, so users don't have to switch context and/or navigate elsewhere as part of the alert triage process.
Proposal
Allow users to view metrics as part of the alert triage workflow for alerts from externally managed Prometheus instances.
Design
We can introduce an additional tab to hold the metrics we receive (or are able to generate):
Loading state | Metric displayed | No metrics available (MVC) |
---|---|---|
![]() |
![]() |
![]() |
Further details
This work supports the Alert Management direction.
Permissions and Security
Documentation
Documentation required. Please add a new sub-section to this section.
Availability & Testing
What does success look like, and how can we measure that?
What is the type of buyer?
Is this a cross-stage feature?
Links / references
Edited by Sarah Waldner