Weekly Reliability (SRE) Team Newsletter – On-call Period:2020-12-08 - 2020-12-15
This issue has been moved and the description cleared of content to avoid polluting the search results of this tracker, see the moved issue link for the original newsletter
Designs
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- ops-gitlab-net added Reliability-Team-Newsletter label
added Reliability-Team-Newsletter label
- AnthonySandoval marked this issue as related to #12041 (moved)
marked this issue as related to #12041 (moved)
- Andrew Newdigate changed the description
Compare with previous version changed the description
- AnthonySandoval changed the description
Compare with previous version changed the description
- AnthonySandoval unassigned @AnthonySandoval
unassigned @AnthonySandoval
- AnthonySandoval changed the description
Compare with previous version changed the description
- AnthonySandoval changed the description
Compare with previous version changed the description
- Owner
https://gitlab.com/gitlab-com/gl-infra/production/-/issues/3104 is being looked into. Needing to figure out how to make builds more reliable on handbook.
- Owner
Notes from 2020-12-14 22:30 handover:
production#3185 (closed) - silence re-entered.
1 Collapse replies - Owner
production#3174 (closed) is still ongoing and may alert fyi.
Wal-g backup alerts (last successful backup) had a silence and may go off again. Depends on the random node picked for doing the backup. Backups were running fine, but false alert due to timestamp on another node that was not cleared. Should not recur, but noting here.
Edited by Dave Smith https://ops.gitlab.net/gitlab-cookbooks/gitlab-walg/-/merge_requests/37 should fix the false alert for
walgBaseBackupDelayed
after a backup failure. 1 1
- ops-gitlab-net changed the description
Compare with previous version changed the description
- Andrew Newdigate changed the description
Compare with previous version changed the description
- Andrew Newdigate changed the description
Compare with previous version changed the description
- Jose Finotto changed the description
Compare with previous version changed the description
- Henri Philipps changed the description
Compare with previous version changed the description
- Alberto Ramos changed the description
Compare with previous version changed the description
- Alberto Ramos changed the description
Compare with previous version changed the description
- AnthonySandoval changed the description
Compare with previous version changed the description
- AnthonySandoval marked this issue as related to #12155 (moved)
marked this issue as related to #12155 (moved)
- AnthonySandoval closed
closed
- 🤖 GitLab Bot 🤖 added workflow-infraDone label
added workflow-infraDone label
moved to reliability-reports#113 (closed)
- John Jarvis changed the description
Compare with previous version changed the description