Ownership of MTBF
Mean Time Between Failure was introduced as a trailing indicator to tell us which services were failing most frequently. This was rolled out for primary services only.
We introduced this in Scalability with the hope that it would help us direct our focus to areas of the system that needed our attention.
We've performed periodic reviews of the data, but the indicator hasn't been helpful in the way we had hoped and has not lead us to new and unknown problems. As such, we plan to stop using this as a team indicator.
The purpose of this issue is to decide if teamReliability would like to take ownership of this indicator, or if we should rather remove it.