The 17.0 major release is coming on May 16, 2024! This version brings many exciting improvements to GitLab, but also removes some deprecated features. We are introducing three breaking change windows during which we expect breaking changes to be deployed to GitLab.com. You can read more about it on our blogpost . The second breaking change window begins 2024-04-29 09:00 UTC and ends 2024-05-01 22:00 UTC.

Thanos and Prometheus not responding under load

Summary

Memory utilization on the primary gprd Prometheus server is causing it to crash.

Service(s) affected : Grafana, Thanos, Prometheus
Team attribution : Reliability Engineering / Observability

Minutes downtime or degradation :

Timeline

2019-08-14

14:14 UTC - Grafana graphs stopped responding as a result of Prometheus crashes
14:45 UTC - An MR has been submitted to resize the instances to address the memory usage: https://ops.gitlab.net/gitlab-com/gitlab-com-infrastructure/merge_requests/923
...

Edited Aug 14, 2019 by AnthonySandoval