0 |
Investigate failing rule evaluations |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10819 |
workflow-infraDone |
1 |
Re-enable Dead Man's Snitch for "gstg prometheus", if needed |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10806 |
workflow-infraDone |
2 |
Gitaly: Log cumulative per-request rusage ("command stats") |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10790 |
workflow-infraUnder Review |
3 |
Ruby CPU profiling (and flamegraphs) in production |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10789 |
workflow-infraUnder Review |
4 |
Shard rules per Prometheus shard |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10788 |
workflow-infraReady |
5 |
Route non-prod alerts to separate slack channel |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10778 |
workflow-infraTriage |
6 |
Resize production logging cluster |
production#2385 (closed) |
workflow-infraReady |
7 |
Upgrade to td-agent 4 |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10752 |
workflow-infraTriage |
8 |
Capture kibana request logs for analysis |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10740 |
workflow-infraReady |
9 |
How did the v1beta1 BackendConfig CRD get installed on the ops cluster, and wher |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10731 |
workflow-infraTriage |
10 |
Runbooks need to trigger alertmanager k8s updates |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10725 |
workflow-infraTriage |
11 |
Enable Trickster as datasource for internal dashboards |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10722 |
workflow-infraTriage |
12 |
customers.GitLab.com should have Prometheus monitoring |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10717 |
workflow-infraTriage |
13 |
Install bpftrace |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10716 |
workflow-infraReady |
14 |
Add a few more generic host-level ad hoc observability utilities to all chef-man |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10714 |
workflow-infraReady |
15 |
reduce the size of the hot nodes fleet |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10697 |
workflow-infraIn Progress |
16 |
Create convenience script to capture perf-record and make flamegraph output |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10592 |
workflow-infraReady |
17 |
Igor Wiedler - On-call Onboarding |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10579 |
workflow-infraIn Progress |
18 |
Install rbspy via chef |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10554 |
workflow-infraReady |
19 |
Add Elasticsearch timing to "Rails Controller" dashboard |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10509 |
workflow-infraReady |
20 |
Update SSL cert for user-content.gitlab-static.net |
production#2238 (closed) |
workflow-infraIn Progress |
21 |
Write tutorial - How to use flamegraphs for performance profiling |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10399 |
workflow-infraReady |
22 |
Write tutorial - Life of a git request |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10389 |
workflow-infraIn Progress |
23 |
Add Slack staging alert for customers.stg.gitlab.com |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10327 |
workflow-infraReady |
24 |
CI Runner Duration Dashboard. |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10313 |
workflow-infraTriage |
25 |
Update Cloudflare alerts |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10294 |
workflow-infraReady |
26 |
Write a style guide and starter templates for infra tutorials |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10291 |
workflow-infraUnder Review |
27 |
Create an infrastructure tutorials section in the runbooks repository |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10290 |
workflow-infraUnder Review |
28 |
Adjust node disk IO quota metrics to be per node, not per device |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10248 |
workflow-infraReady |
29 |
Alert when jobs are not being processed by sidekiq |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10242 |
workflow-infraReady |
30 |
long-term plan for logging |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10095 |
workflow-infraIn Progress |
31 |
ElasticCloud Watcher: gitaly_abuse_1 triggered with empty project names |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/10007 |
workflow-infraReady |
32 |
Fix assorted persistent prometheus scrape errors |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/9106 |
workflow-infraTriage |
33 |
move pubsubbeats to k8s |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/8962 |
workflow-infraIn Progress |
34 |
Add alert and monitoring to ops.gitlab.net |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/8488 |
workflow-infraReady |
35 |
Create new Grafana dashboard git backup |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/7788 |
workflow-infraIn Progress |
36 |
Create database for GKE Grafana service |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/7787 |
workflow-infraReady |
37 |
Create Elastic Search mappings for IP address log fields in ES Cloud for pubsub |
https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/4689 |
workflow-infraTriage |