Update PatroniXminAgeTooLargeError to ignore REDINEX INDEX CONCURRENTLY
Summary
We have an alert PatroniXminAgeTooLargeError which ends up paging the on-call when there is a large REINDEX INDEX CONCURRENTLY
happening. This usually happens during the weekend when we run background workers to change some indexes such as gitlab-org/gitlab#391671 (closed)
This uses metrics pg_txid
which is a custom query run in postgres exporter.
Related Incident(s)
Originating issue(s): production#8417 (closed)
Desired Outcome/Acceptance Criteria
We shouldn't consider REDINEX INDEX CONCURRENTLY
in the metric like we do for PatroniLongRunningTransactionDetected
alert
Associated Services
Corrective Action Issue Checklist
-
Link the incident(s) this corrective action arose out of -
Give context for what problem this corrective action is trying to prevent from re-occurring -
Assign a severity label (this is the highest sev of related incidents, defaults to 'severity::4') -
Assign a priority (this will default to 'Reliability::P4')