OnCall report for period: 2018-02-20 - 2018-02-27
Oncall during this period
Schedule | Username |
---|---|
AMA | Alejandro Rodriguez |
AMA | Ilya Frolov |
EU | Ahmad Sherif |
EU | Jason Tevnan |
EU | John Jarvis |
PagerDuty Incidents
- Number of incidents: 17
Created | Summary |
---|---|
2018-02-20T19:40:21Z | [#1294 (closed)] Pingdom check Version server is down |
2018-02-20T22:05:54Z | [#1295 (closed)] Firing 1 - prometheus is unreachable |
2018-02-21T06:30:59Z | [#1296 (closed)] Firing 1 - Postgres is generating XLOG too fast, expect this to cause replication lag |
2018-02-21T21:19:36Z | [#1297 (closed)] Firing 2 - 1% disk space left |
2018-02-22T02:57:37Z | [#1298 (closed)] Firing 1 - 1% disk space left |
2018-02-22T07:27:48Z | [#1299] Pingdom check Dev.gitlab.org issue is down |
2018-02-22T13:50:56Z | [#1300 (closed)] Pingdom check GitLab.com Pages is down |
2018-02-23T07:26:39Z | [#1301] Pingdom check Dev.gitlab.org issue is down |
2018-02-24T07:27:42Z | [#1303 (closed)] Pingdom check Dev.gitlab.org issue is down |
2018-02-24T22:25:53Z | [#1304] Firing 1 - 1% disk space left |
2018-02-25T13:47:16Z | [#1305 (closed)] Pingdom check GitLab.com master branch is down |
2018-02-25T13:47:28Z | [#1306 (closed)] Pingdom check GitLab.com new repo is down |
2018-02-25T13:47:31Z | [#1307 (closed)] Pingdom check GitLab.com issue is down |
2018-02-25T13:47:48Z | [#1308 (closed)] Pingdom check GitLab.com public check is down |
2018-02-26T01:39:29Z | [#1309 (closed)] Pingdom check Version server is down |
2018-02-26T15:46:35Z | [#1310 (closed)] Pingdom check Version server is down |
2018-02-27T14:47:02Z | [#1311 (closed)] Pingdom check GitLab.com Pages is down |
Issues
Stats for the last oncall period
- Total number of oncall issues opened in the last on call shift: 14
- Access Request: 1
- Critical: 0
- Total number of oncall issues closed in this milestone: 0
- Access Request: 0
- Critical: 0
Open OnCall Issues
- Total number of open oncall issues: 20
- Access Request: 2
- Critical: 0
Created | Assignee | Summary |
---|---|---|
26 Feb 18 17:21 UTC | dimitrieh | Configure review apps and domain for design.GitLab.com repository |
21 Feb 18 17:39 UTC | unassigned | Add more Sidekiq capacity |
21 Feb 18 01:32 UTC | unassigned | fix azure snapshots |
20 Feb 18 21:57 UTC | unassigned | Delays in nfs-08, possibly due to user hammering a repository |
19 Feb 18 10:52 UTC | unassigned | Certificate expired for domain gitlab.io |
19 Feb 18 10:08 UTC | unassigned | Create Gitter VPN accounts for all production engineers |
19 Feb 18 10:06 UTC | unassigned | Move the Gitter alerts to the GitLab Pagerduty account |
19 Feb 18 10:02 UTC | unassigned | Use personal ssh accounts instead of the deployer one |
19 Feb 18 09:52 UTC | omame-gitlab | [META] Gitter infrastructure handover |
14 Feb 18 18:59 UTC | jarv | Request: staging database access |
14 Feb 18 13:02 UTC | unassigned | Regression in Deploy |
12 Feb 18 09:28 UTC | unassigned | offboarding Victor Lopez |
12 Feb 18 08:51 UTC | bjk-gitlab | Re-enable NFS metrics collection in node_exporter |
09 Feb 18 08:14 UTC | unassigned | Missing alert for inodes on Gitter nodes |
08 Feb 18 18:44 UTC | unassigned | Manage redis cache config via omnibus |
05 Feb 18 22:55 UTC | northrup | Disable custom domains in GitLab Pages |
05 Feb 18 09:25 UTC | jarv | offboarding pablo |
26 Jan 18 13:47 UTC | northrup | Name resolution errors on nfs-file-XX machines |
21 Jan 18 17:57 UTC | unassigned | XLOG generation peak |
18 Jan 18 11:49 UTC | unassigned | Failed ssh connection monitoring |
Weekly Ops
Web/Git/API p95 latency
Gitaly p95 latency
Sidekiq CPU
API CPU
Git CPU
Web CPU
NFS timeouts
This issue was automatically generated using oncall-robot-assistant