KR: Certify - Improve gathering of metrics and Service Dashboard
Goals:
- Work with the Scalability team to advise on how our service dashboard can be improved
- Gather error logging from Service Desk workers
- Related: Implement SLIs for Service Desk (gitlab-org/gitlab#298744 (closed))
Dashboard | https://dashboards.gitlab.net/d/stage-groups-certify/stage-groups-group-dashboard-plan-certify |
Engineering DRI | @jprovaznik |
Scalability team issue | gitlab-com/gl-infra/scalability#897 (closed) |
Spike | gitlab-org/gitlab#325863 (closed) |
Outcomes
- Scalability team have the information required to build a comprehensive Service Dashboard for Certify
- Better visibility over Service Desk errors in email deliverability (we currently have none)
- SLIs/alerts on elevated errors/downtime for Certify services gitlab-org/gitlab#298744 (closed)
Edited by John Hope