Alerts: Audit Log and notification channels
Description
As GitLab continues to add additional monitoring features and capabilities, a key foundation will be the ability to notify users and administrators of events that need attention. These could come from a variety of different sources:
APM
- Defined alerting thresholds for metrics (https://gitlab.com/gitlab-org/gitlab-ee/issues/4451)
- Automated anomaly detection (https://gitlab.com/gitlab-org/gitlab-ee/issues/3610)
Logging
- Auto Log Alerts (https://gitlab.com/gitlab-org/gitlab-ee/issues/3626)
GitLab
- GitLab Service Alerts
Rather than building all the necessary alerting and other functionality into each of these areas, we can instead build a centralized alerting functionality. This would reduce the amount of work, and offer a single UI to manage notifications across these types of events.
Proposal
High priority items
-
Alerts should support going out over configured Chat services, like Slack or Mattermost -
Alerts should be displayed in their own UI, with the ability to look at the history to review what was firing, when, and for how long.
Later items
-
Add support for notifications via SMS -
Add support for responding and acknowledging alerts via notification methods, log this in the audit log. -
Alerts should also feed into the Service Status Dashboard and Internal Ops Dashboard (https://gitlab.com/gitlab-org/gitlab-ee/issues/3541), if acknowledged as a problem
Edited by silv