Surface metrics charts on the alert detail page for alerts from GitLab-managed Prometheus
Problem to solve
Charts help users visualize what went wrong while triaging alerts. If a particular threshold was exceeded, we can reduce time spent during investigation by automatically including the relevant metrics chart in the alert.
Intended users
User experience goal
Have metrics available as part of the alerts workflow, so users don't have to switch context and/or navigate elsewhere as part of the alert triage process.
Proposal
Allow users to view metrics as part of the alert triage workflow for alerts from GitLab managed Prometheus instances.
To ensure the smallest possible iteration, we can use the same time-range behaviour as incident embeds - to center on the start-time and show 30 mins before and after: https://docs.gitlab.com/ee/user/project/integrations/prometheus.html#embedding-metrics-based-on-alerts-in-incident-issues
Design
We can introduce an additional tab to hold the metrics we receive (or are able to generate):
Loading state | Metric displayed | No metrics available (MVC) |
---|---|---|
Further details
Permissions and Security
Documentation
Documentation required. Please add a new sub-section to this section.