L Partition taggings table

added grouppipeline execution maintenancescalability typemaintenance + 1 deleted label

added devopsverify sectionci + 1 deleted label

changed the description

marked this issue as related to #350053 (closed)

@drew @carolinesimpson here's the issue to improve taggings.

@mbobin @drew I pulled this into the Cells Support epic for now, but it could also go into a Partitioning epic. I think it is really needed for both efforts. WDYT?

@carolinesimpson @mbobin Agreed, given the size of the taggings table, should this be prioritized with the ongoing partitioning effort? Or scheduled as a follow-up once we partition the 6 CI tables?

@cheryl.li @carolinesimpson I can help as a dedicated Verify & Database reviewer to speed it up this effort, but I don't have any particular preference for which epic it should belong to. 😇

Thanks @mbobin. Based on this, we should probably schedule in the Next 4-6 releases (e.g. after the current CI Data Partitioning effort is complete) - WDYT @carolinesimpson @rutshah?

mentioned in issue gitlab-org/quality/triage-reports#16631 (closed)

added to epic &12323 (closed)

mentioned in issue #448635 (closed)

mentioned in issue gitlab-org/quality/triage-reports#16733 (closed)

mentioned in issue gitlab-org/quality/triage-reports#16816 (closed)

set weight to 5

changed title from Partition taggings table to L Partition taggings table

mentioned in issue gitlab-org/quality/triage-reports#16922 (closed)

added Category:Continuous Integration + 1 deleted label

changed milestone to %Backlog

mentioned in epic &12323 (closed)

@mbobin I haven't looked at the internals of AATO in many years, but do you think there's some amount of configuration we could do to split them up into separate tables?

It would be nice to avoid vendoring the gem

@drew looks like you could specify the taggings table name, but that's still a polymorphic table without FKs and a way to inject the partitioning constraint. While the table name is configurable, its structure is not and it still remains a global table.

Based on current progress, I don't think we'll get to partitioning taggings til we finish the current tables in flight, e.g. starting some time in FY25-Q3 (July). @mbobin @tianwenchen WDYT?

Any rough effort estimates for partitioning this table?

Sounds good to me. I guess my rough estimate is 3 milestones, one milestones for all the changes to prepare and update the gem and create our own tables, and one for executing the backfill considering the table size of taggings, and one for cleanup.

@tianwenchen I moved this to workflowready for development as you were able to weight it already. 🙂

removed Category:Continuous Integration + 1 deleted label

added groupci platform + 1 deleted label and removed grouppipeline execution + 1 deleted label

mentioned in issue gitlab-org/quality/triage-reports#17616 (closed)

mentioned in issue gitlab-org/quality/triage-reports#17718 (closed)

added Category:Continuous Integration (CI) Scaling + 1 deleted label

mentioned in issue #350053 (closed)

changed the description

added workflowplanning breakdown + 1 deleted label

changed milestone to %Next 4-6 releases

changed milestone to %Next 1-3 releases

added workflowready for development + 1 deleted label and removed workflowplanning breakdown + 1 deleted label

changed milestone to %17.2

@mbobin Can you help me break down this issue into several issues, as it'll take 3ish milestones to fully partition the table (hence the 5 weight)? Feel free to promote it into its own epic. As discussed, I'd like to get started on this in %17.2.

closed

promoted to epic &14167

mentioned in epic &14167

changed epic to &13678

changed epic to &12323 (closed)

mentioned in issue gitlab-org/database-team/team-tasks#447 (closed)

@mbobin grouptenant scale noticed that this issue is closed, but it is still mentioned as sharding_key_issue_url for 2 tables in their database dictionary file here and here.

Sharding key issue URLs are supposed to remain open until the work on sharding keys on that table is complete. If the work on this table is complete, sharding_key_issue_url can be removed from the yml.

Could you please make sure that either:

An open issue is present in the yml for tags and taggings table?
Or, if the work is complete, the sharding_key_issue_url is removed from the yml?

Thank you 🙂

@manojmj this was promoted to epic because it too much work to do in a single issue. Could we add an epic in sharding_key_issue_url?

@mbobin thanks for the quick response 🙂

Could we add an epic in sharding_key_issue_url?

Unfortunately not right now, as we are pattern matching to a issue URL's regex over here.

I would imagine that this has been done because different tables would warrant a different amount of work, and would finish at different points in time and it would be much easier to pull out each table into their own issue and track them seperately rather than one epic that encapsulates everything.

That said, I don't see a problem in tracking work of multiple tables in a single epic too (we already have multiple tables that point to the same issue URL, see the dashboard). It is just that you'd need to make changes in the specs too to accomodate an epic's regex as a valid sharding_key_issue_url.

@manojmj then I guess we could use the last issue in the epic as the sharding_key_issue_url: #467202

!163710 (merged)

Hello @mbobin Can you update the sharding_key_issue_url here with the opened issue? 🙇

@shubhamkrai !165781 (merged)

mentioned in epic gitlab-org#12323 (closed)

L Partition taggings table

Summary

Improvement steps

Risks

Involved components

Optional: Intended side effects

Optional: Missing test coverage

Designs

Child items 0

Activity

L Partition taggings table

Summary

Improvement steps

Risks

Involved components

Optional: Intended side effects

Optional: Missing test coverage

Relates to

Activity