Database Group Triage for week ending 2025-08-01
About
This issue is used by groupdatabase frameworks to triage issues and make sure they get properly assigned and prioritized. Each week, a bot will look up the old issue, pick the next assignee in the list, and submit a new issue with a list of any issues that may need attention from the team.
Process
-
Review any issues with undefined types -
Post any questions or pressing issues to the database group meeting doc
Bugs needing Severity
For each issue below:
- For each typebug, spend up to 1 hour investigating or fixing the issue.
- Assign it one of severity1, severity2, severity3, or severity4
- Document any findings you make in a comment on the issue, and if the issue still needs additional work or refinement, consider looping in
@alexivesand@to help with scheduling and priority.
-
delete_batched_background_migration is causing long running statements that timeout on the database -
BackfillIssueAssigneesNamespaceId causes migration to fail -
Database migrations failed for upgrade from 18.1.2 to 18.2.0 -
ActiveRecord::StatementInvalid (PG::UndefinedColumn: ERROR: column application_settings.user_email_lookup_limit does not exist -
background migrations failed -
Database migration error from 20211103162025
Issues with Undefined Type
For each issue below:
- Assess if the issue is appropriately assigned to groupdatabase frameworks, if not add the correct group label.
- Add the proper work type label, or if the issue is a request for support, redirect the user to our support resources with the following message:
Hey @author. Based on the information given, this request for support is out of the scope of the issue tracker (which is for new bug reports and feature proposals). Unfortunately, I won't be able to help get it resolved. However, for support requests we have several resources that you can use to find help and support from the Community, including: * [Technical Support for Paid Tiers](https://about.gitlab.com/support/) * [Community Forum](https://forum.gitlab.com/) * [Reference Documents and Videos](https://about.gitlab.com/get-help/#references) Please refer to our [Support page](https://about.gitlab.com/support/) for more information. If you believe this was closed in error, please feel free to re-open the issue. /label ~"support request" /close - If the issue needs further investigation, add databasetriage and spend up to 1 hour of investigating the issue.
- Document any findings you make in a comment on the issue, and if the issue still needs additional work or refinement, consider looping in
@alexivesand@to help with scheduling and priority.
-
Swap table names between the partitioned table and vulnerability_occurrences -
Create new partitioned vulnerability_readstable -
How to force / finish BackgroundMigrationJob -
Add documentation for migrating data from Omnibus Install to External Services (RDS, Redis,Elasticsearch)
Recent issues labeled database
For each issue below:
- If the issue has no
grouplabel, consider if it should be addressed by groupdatabase frameworks and if so label it. - If the issue has a group, and you think they may need assistance from us:
- If the issue needs further investigation, add databasetriage and spend up to 1 hour of investigating the issue.
- Document any findings you make in a comment on the issue, and if the issue still needs additional work or refinement, consider looping in
@alexivesand/or@
-
grouprunner Cross-Database Modification Error Causing Pipeline Timeouts -
~"group::not_owned" Cleanup offer_email_reset associated database columns -
~"group::not_owned" Database query performance optimization -
~"group::not_owned" Database query performance optimization -
~"group::not_owned" Database query performance optimization -
~"group::not_owned" Database query performance optimization -
groupcomposition analysis Remove the ref-contextual columns from the sbom_occurrences table -
groupcomposition analysis Update the SBOM types and associated services to begin retrieving non-ref contextual data from the sbom_dependencies table -
groupsecurity infrastructure Update the sbom ingestion process to begin inserting non-ref specific data into the sbom_dependenciestable -
groupsecurity infrastructure Create the sbom_dependenciestable -
groupsecurity infrastructure Update Sbom Finders to retrieve their sbom records from the sbom_occurrence_readstable -
groupsecurity infrastructure Adjust SBOM ingestion process to fill in the security_project_ref_idfor new sbom occurrence records -
groupsecurity infrastructure Backfill sbom_occurrences.security_project_ref_idwith the default ref for each project -
groupsecurity infrastructure Add security_project_ref_idto thesbom_occurrencestable -
groupsecurity infrastructure Implement synchronization of sbom_occurrence data to the sbom_occurrence_reads table -
groupsecurity infrastructure Create sbom_occurrence_reads table -
~"group::not_owned" Drop the old vulnerabilities table -
~"group::not_owned" Swap table names between the existing vulnerabilities table and the partitioned one -
groupsecurity insights Backfill existing vulnerability data to the new partitioned vulnerabilities table -
groupsecurity insights Create new partitioned vulnerabilities table
Customer Issue Hand-offs
For each issue below:
- If the item already has a back and forth, check in with the @irina.bronipolsky to see if what needs to be handed off
- Consider if it's right for our team and if not, ask the support rep to follow up with the correct team.
- If the item is right for our team, spend some time accessing it and trying to assist.
- If you need help, and the request seems pressing, ask in the team channel if there's someone who can help dig into it.
-
Background migration job "BackfillPartitionedWebHookLogsDaily: web_hook_logs" stuck at finializing~"Help group::Database Frameworks", ~"RFH-Lifecycle::Last comment from Support", severity3
Review Top Queries for Changes
There were some new anomalous queries to review on the main database, broken down by metric type:
| Metric | # of queries |
|---|---|
| by_total_time | 0 |
| by_time_avg | 5 |
Please find the detailed query report here (Ops access required)
For each query listed:
- Spend up to 30 minutes trying to understand its source.
- If needed, determine the team that owns the query and file an issue with them. A good place to start would be to check the relevant table's owner under
db/docs.
For additional context around how these queries were determined to be anomalous as well as historical rankings for known queries, all of the data we have collected lives in a database dump stored in artifacts in the query-stats-reporting repo on ops. Specifically check query-stats.yml for an example of how you can use this dump to locally rehydrate this database on your own Postgres instance and analyze the collected statistics. For this iteration of this report, queries are grouped by fingerprint.
Legacy query analysis
It is not necessary to look at this unless we need to check the new anomalous query report against the top query report on the main database.
Click to expand
For each database:
- Are there new queries in the top queries (See: K003 Top-15 Queries by total_time) compared to a previous report?
- If there are new queries, Spend up to 30 minutes trying to understand their sources
- If there are new queries, file an issue and assign to the team that owns the query, or if unable to source, then the team that owns the table
- If there are no new queries, review the top 5 queries for each to see if there are already investigations, or file issues to investigate them
-
Review recent Primary Checkup for new Top Queries (K003) -
Review recent CI Checkup for new Top Queries (K003)
Review Dashboards on queries with large In-Lists
For both Sidekiq and Rails:
- If the report is empty, no action is needed
- For each item on the report:
- Determine what feature the query belongs to
- Create an issue for the team owning the feature category asking them to limit the maximum number of items in the in query
Int4 Saturation Review
For issue(s) linked below:
-
Ensure each table referenced in the report has an associated issue with ~"Gitlab.com Resource Saturation" and infradev attached
-
Make sure the issue is assigned to a team.
-
All tables approaching saturation in Capacity warning for patroni: pg_int4_id have an issue assigned to a team.
-
@alexives if @jon_jenkins isn't available this week, please reassign to @krasio
cc @
This is generated by this project.