GitLab.com outage 2017-10-23
Timeline of events (in UTC time):
-
13:16load average onapi-08goes up to19and seem to makenfs-file-12load to go way up -
13:17load average onnfs-file-12goes up to400+ -
13:18to13:20number of reqs per second goes down from~500to~70 -
13:20load average on the frontend fleet goes up to30in average -
13:25to13:27number of reqs per second goes down again to~60 -
13:25web-05andweb-10went MIA -
13:26load average onnfs-file-10goes up to300 -
14:05web-10came up after issuing reboot from the Azure panel -
14:16web-02came up after issuing reboot from the Azure panel -
14:24web-09came up after issuing reboot from the Azure panel
Edited by Victor Lopez