Research if we can cancel pipeline when an error rate exceeds a threshold.
As part of &3088 we want to see if we can stop deployment in case a threshold is met
We shall start with
| HTTP Error Rate (%) | sum(rate(nginx_ingress_controller_requests{status=~"5.",namespace="%{kube_namespace}",ingress=~".%{ci_environment_slug}."}[2m])) / sum(rate(nginx_ingress_controller_requests{namespace="%{kube_namespace}",ingress=~".%{ci_environment_slug}.*"}[2m])) * 100 |
For the POC we will use HTTP Error Rate (%)
- Using the existing Prometheus API we will query the current threshold of error rates and see if this can trigger an event to the pipeline
Edited by Shinya Maeda