Platform (UI): DBLab Instance status monitoring and Alerts
Part 1 (email): To ensure the optimal state of DBLab instances, we require a configurable monitoring and alerting system integrated into the platform UI. This system should deliver automated notifications for specific metrics and thresholds and allow configuration of email recipients for these alerts.
Alerts to be Implemented:
- DBLab Engine Status
- Monitor the instance’s operational state.
- Trigger an alert if the status is not OK.
- Snapshot Creation Time (Lag)
- Track the time since the last snapshot was created (newest data state time).
- Trigger an alert if the snapshot age exceeds a configurable threshold (e.g., 1 day).
- ZFS Pool Free Space
- Monitor the available space in the ZFS pool.
- Trigger an alert if utilization exceeds a set threshold (e.g., 80%).
Part 2 (banner): Additionally, a banner under the header would be useful for displaying relevant alerts or warnings specific to each instance.
Edited by Vitaliy Kukharik