2020-09-14 - Triage report for "group::health"

Hi, @sarahwaldner @crystalpoole @ClemMakesApps @ameliabauerly @kbychu

This is a group or stage level triage report that aims to summarize the feature proposals and bugs which have not been scheduled or triaged. For more information please refer to the handbook:

  • https://about.gitlab.com/handbook/engineering/quality/triage-operations/index.html#triage-reports

Scheduling the workload is a collaborative effort by the Product Managers and Engineering Managers for that group. Please work together to provide a best estimate on priority and milestone assignments. For each issue please:

  • Determine if the issue should be closed if it is no longer relevant or a duplicate.
  • If it is still relevant please assign either a best estimate versioned milestone, the %Backlog or the %Awaiting further demand milestone.
  • Specifically for ~bug, if there is no priority or clarity on a versioned milestone, please add a Priority label. Priority labels have an estimate SLO attached to them and help team members and the wider community understand roughly when it will be considered to be scheduled.
    • https://docs.gitlab.com/ee/development/contributing/issue_workflow.html#priority-labels
  • Once a milestone has been assigned please check off the box for that issue.
  • Please work with your team to complete the list by the due date set.

Feature Proposal Section

For the following feature proposals. Please either close or assign either a versioned milestone, the %Backlog or the %Awaiting further demand milestone.

Unscheduled feature with customer

  • #232746 Shareable configured Operations Dashboard customer, devopsmonitor, feature, ~"group::health", sectionops
  • #223689 Show Puma metrics in "GitLab Omnibus - Overview" dashboard instead of Unicorn GitLab Self-Monitoring, customer, devopsmonitor, feature, ~"group::health", sectionops

Unscheduled feature (non-customer)

  • #248950 (closed) Remove alert table specific css CSS cleanup, ~"Category:Alert Management", Category:Incident Management, devopsmonitor, feature, ~"feature::maintenance", frontend, ~"group::health", sectionops, ~"technical debt"
  • #245223 (closed) Follow-up from "Allow to update issuable severity via GraphQL" Category:Incident Management, backend, devopsmonitor, feature, ~"feature::maintenance", ~"group::health", sectionops
  • #244943 (closed) Allow dedicated incidents to show up in the Operations nav rather than the Issues nav Category:Incident Management, devopsmonitor, feature, ~"group::health", sectionops
  • #238225 (closed) Locale number format can differ between detail and axis tick values in Gauge type panels UX, devopsmonitor, feature, frontend, ~"group::health", sectionops
  • #238203 (closed) Investigate the required status of the metric ID field in dashboard YAML UX, devopsmonitor, feature, frontend, ~"group::health", sectionops, workflowvalidation backlog
  • #238087 (closed) Allow users to disable tracing devopsmonitor, feature, ~"group::health", sectionops
  • #237889 (closed) "Add panel" preview in Metrics should update automatically UX, devopsmonitor, feature, ~"group::health", sectionops, workflowdesign
  • #235464 (closed) Remove old metrics dashboard schema validation code backend, devopsmonitor, feature, ~"feature::maintenance", ~"group::health", sectionops, ~"technical debt"
  • #234045 (closed) Add BE validations specific to Gauge type panel YML backend, devopsmonitor, feature, ~"group::health", sectionops
  • #233034 (closed) Automatically add incidents as annotations to designated metrics dashboard chart devopsmonitor, feature, ~"group::health", sectionops
  • #232922 (closed) Bookmark or save drafts of the Panel Preview YAML auto updated, devopsmonitor, feature, ~"group::health", sectionops
  • #228649 (closed) Improvements to metrics dashboard environment selector UX, devopsmonitor, feature, featureaddition, ~"group::health", sectionops, workflowdesign
  • #227931 (closed) Cleanup old metrics dashboard code backend, devopsmonitor, feature, ~"feature::maintenance", frontend, ~"group::health", sectionops
  • #227930 (closed) Start using the new metrics and metrics_dashboard routes backend, devopsmonitor, feature, featureaddition, frontend, ~"group::health", sectionops
  • #227928 (closed) Fix the dashboard vue app to work with the new and old metrics dashboard routes devopsmonitor, feature, featureaddition, frontend, ~"group::health", sectionops
  • #227927 (closed) Begin using new Prometheus proxy API backend, devopsmonitor, feature, featureaddition, ~"group::health", sectionops
  • #227926 (closed) Create view and JS files for new metrics dashboard devopsmonitor, feature, featureaddition, frontend, ~"group::health", sectionops
  • #227925 (closed) Create new Prometheus proxy API backend, devopsmonitor, feature, featureaddition, ~"group::health", sectionops
  • #227924 (closed) Create new metrics dashboard routes and controller backend, devopsmonitor, feature, featureaddition, ~"group::health", sectionops
  • #227840 (closed) Refactor Prometheus::ProxyService to be more reusable backend, devopsmonitor, feature, featureaddition, ~"group::health", sectionops
  • #222451 (closed) Support multiple series for stacked column charts devopsmonitor, feature, ~"group::health", sectionops, workflowproblem validation
  • #219425 (closed) Add gitlab.com dashboards to self monitoring project devopsmonitor, feature, ~"group::health", sectionops
  • #214325 Figure out where Operations tools settings should live ~"Accepting merge requests", UX, devopsmonitor, feature, ~"group::health", missed:13.1, sectionops
  • #212420 (closed) Cluster Applications: Allow for the user to retry an update on update_errored devopsmonitor, feature, ~"group::health", sectionops
  • #211367 (closed) Add Product Instrumentation to GitLab - MVC ~"Accepting merge requests", devopsmonitor, feature, ~"group::health", sectionops
  • #208427 (closed) Disable unused features of GitLab self-monitoring project ~"Accepting merge requests", GitLab Self-Monitoring, devopsmonitor, feature, ~"group::health", sectionops, workflowplanning breakdown
  • #208284 (closed) Add dangerbot reminder to update metrics docs ~"Accepting merge requests", GitLab Self-Monitoring, devopsmonitor, feature, ~"group::health", sectionops
  • #207060 (closed) Add support for labels for folders/groups ~"Accepting merge requests", Dogfooding, DogfoodingBuild in GitLab, backend, devopsmonitor, documentation, feature, frontend, ~"group::health", sectionops
  • #207058 (closed) Add label support for dashboards Dogfooding, DogfoodingBuild in GitLab, backend, devopsmonitor, documentation, feature, frontend, ~"group::health", sectionops, workflowdesign
  • #198322 (closed) Support setting alerts for percentage and percentile thresholds ~"Accepting merge requests", Category:APM alerts, ~"Category:Alert Management", backend, devopsmonitor, feature, frontend, ~"group::health", sectionops

Bug Section

For the following bugs. Please either close or assign either a versioned milestone, the %Backlog or the %Awaiting further demand milestone and ensure that a priority label is set.

Heatmap for all bugs

Bugs for their priority and severity label are counted here. Every bug should have severity and priority labels applied. Please take a look at the bugs which fall into the columns indicating that the priority or severity labels are currently missing.

severity1 severity2 severity3 severity4 No severity
priority1 0 0 0 0 0
priority2 0 0 1 3 0
priority3 0 0 39 6 1
priority4 0 1 2 46 0
No priority 0 0 4 2 9

Unscheduled frontend ~bug (non-customer)

  • #238179 (closed) Deployment tooltips are hard to focus/hover on UX, ~"bug", devopsmonitor, frontend, ~"group::health", priority4, sectionops, severity4
  • #208933 (closed) Only last of duplicated metrics dashboards panel are feed with data ~"bug", devopsmonitor, frontend, ~"group::health", priority4, sectionops, severity4
  • #37460 (closed) Portions of the Metrics Dashboard "More Actions" menu is hidden by the left sidebar ~"bug", devopsmonitor, frontend, ~"group::health", sectionops
  • #26542 (closed) Follow-up from "Use SVG icon for deployment series" auto updated, ~"bug", devopsmonitor, frontend, ~"group::health", priority3, sectionops, severity4

Unscheduled ~bug with customer

  • #236017 (closed) Problem with alerts not working from metric page with gitlab installed prometheus on kubernetes 1.16 Category:APM alerts, ~"bug", customer, devopsmonitor, ~"group::health", sectionops
  • #214126 (closed) After a 'Can't reserve bytes for memory-mapped file', Unicorn starts to throw 'unmapped file' and 'deadlock; recursive locking' errors GitLab Self-Monitoring, ~"bug", customer, devopsmonitor, ~"group::health", sectionops
  • #26340 (closed) Cleanup stale Prometheus metrics ~"bug", customer, devopsmonitor, ~"group::health", priority4, sectionops, severity4

Unscheduled ~bug (non-customer)

  • #244870 (closed) Error displaying incidents ~"bug", devopsmonitor, ~"group::health", operations dashboard, sectionops, severity3
  • #243543 (closed) Docs feedback: Current health-check endpoint (users/sign_in) does not work in AWS Category:Error Tracking, ~"bug", devopsmonitor, ~"group::health", sectionops, severity3
  • #241304 (closed) Reset Webhook URL button is in the incorrect place ~"bug", devopsmonitor, ~"group::health", sectionops
  • #234052 (closed) Unable to remove empty/dead default metric groups ~"bug", devopsmonitor, ~"group::health", sectionops, severity3
  • #232908 (closed) Not able to see Cluster Health from Prometheus after installing it from pipeline ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #231462 (closed) Alert values in GitLab database can go out of sync with Prometheus Category:APM alerts, ~"Category:Alert Management", ~"bug", devopsmonitor, ~"group::health", priority3, sectionops, severity3
  • #215853 (closed) NoMethodError in Grafana validator: undefined method `include?' for nil:NilClass ~"bug", devopsmonitor, ~"group::health", sectionops
  • #208676 (closed) Can't create self monitoring project: Could not create instance administrators group. Errors: ["You don’t have permission to create groups."] APM, ~"Accepting merge requests", backend, ~"bug", devopsmonitor, ~"group::health", priority3, sectionops, severity3
  • #194475 (closed) Error in Operations Settings if Pipelines are disabled ~"bug", devopsmonitor, ~"group::health", operations, sectionops, settings
  • #32577 (confidential) ~"(confidential)"
  • #10317 (closed) Check prometheus_alerts license in NotifyService Enterprise Edition, ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #26369 (closed) Metrics dashboard renders incorrectly if there are too many labels ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #26341 (closed) Add metric to measure negative Rails queue durations ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #25887 (closed) Unable to connect to Prometheus server ~"bug", devopsmonitor, ~"group::health", priority3, sectionops, severity3
  • #25812 (closed) Order metrics on performance dashboard by weight within a metric group backend, ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #25463 Optimize Deployment#last_for_environment ~"bug", devopsmonitor, ~"group::health", priority4, sectionops, severity4
  • #24712 (closed) Provide a more helpful error message when blocked from local Prometheus ~"bug", devopsmonitor, ~"group::health", priority3, sectionops, severity4

Heatmap for ~missed-SLO bugs

severity1 severity2 severity3 severity4 No severity
priority1 0 0 0 0 0
priority2 0 0 0 2 0
priority3 0 0 2 0 0
priority4 0 0 0 0 0
No priority 0 0 0 0 0


This is a group level triage report that aims to collate the latest bug reports (for frontend and otherwise) and feature proposals. For more information please refer to the handbook:

  • https://about.gitlab.com/handbook/engineering/quality/triage-operations/index.html#triage-reports

If assignees or people mentioned in this individual triage report need to be amended, please edit group-definition.yml.

Edited Sep 14, 2020 by Sarah Waldner
Assignee Loading
Time tracking Loading