Implement SLIs and saturation points for Zoekt backend service
Background
The zoekt service does currently not have any SLIs defined. The readiness review refers to The Global Search stage group dashboard.
We do measure SLIs on the client side. And we have the Zoekt Info dashboard which includes server-side metrics, albeit not in standard SLI form.
Problem
The SLI framework gives us various benefits that we are missing out on:
- Standard dashboards that have the same feel, making diagnosis easier.
- Alerting (measured on the server).
- Capacity planning based on saturation metrics.
Proposal
Bring the metrics from the Zoekt Info dashboard into SLIs (apdex, request rate, error rate) and saturation points (disk_maximum_capacity
, cpu, etc) on the zoekt
service.
Remove the old dashboard once the move is complete.
(*) denotes a metric we may want to add
(existing) Zoekt info | (new) zoekt: Overview |
---|---|
Service Level Indicators
|
|
Service Level Indicators
|
|
Service Level Indicators
|
|
Service Level Indicators (NEW) Global Search SLI apdex + error for |
|
Saturation Details | |
Saturation Details | |
Saturation Details | |
Saturation Details |
|
Saturation Details |
|
Saturation Details |
|
Saturation Details | |
Saturation Details | |
Saturation Details | |
Saturation Details | |
Saturation Details (NEW) Memory Usage |