fix(alerts): ai-gateway igress error ratio
What
Reduce the error ratio from 0.999 to 0.98 for the runway_ingress
SLI
for the ai-gateway
service.
Before | After |
---|---|
![]() source |
![]() source |
Why
This alerts paged the on-call 25 times in the last 14 days (1-2 times every weekday).
This alert is not actionable at the moment until we fix https://gitlab.com/gitlab-com/gl-infra/production/-/issues/17366.
Reference: https://gitlab.com/gitlab-com/gl-infra/production/-/issues/17366
Merge request reports
Activity
added maintenancerefactor typemaintenance labels
assigned to @sxuereb
changed milestone to %16.9
added grouptenant scale label
@cfeick would you mind if you review this?
requested review from @cfeick
- Resolved by Steve Xuereb - Out of Office Back 2025-04-21
mentioned in commit 6b540764
A pipeline is running on a mirror related to this merge request.
Status: starting
https://ops.gitlab.net/gitlab-com/runbooks/-/pipelines/2779275
This MR is included in version 2.354.1The release is available on GitLab release.
Your semantic-release bot
mentioned in issue gitlab-com/gl-infra/on-call-handovers#4612 (closed)
mentioned in issue gitlab-com/gl-infra/reliability-reports#219 (closed)
mentioned in merge request !8538 (merged)