[Discussion] How to handle required work item types to exist in DB before app run

mentioned in merge request !128307 (merged)

changed the description

@gitlab-org/plan-stage/backend-engineers I'd really appreciate your input here. I think this discussion is long overdue and I hope we can come up with a final more appropriate solution to what we currently have. As new teams start creating new work item types, I think we should be more certain on the right way to do this.

cc @mayra-cabrera @mkaeppler @tkuah since you have been involved in some of this discussion, I'd really appreciate your input

@mcelicalderonG Can you please help me understand what data is :

What is the size, and type for each work_item_types. (e.g. 10 rows in work_item_types table, type String
Which work_item_types is strictly required and which is optional ?

Related today we have:

ApplicationSetting in db/fixtures/production/001_application_settings.rb, and also we do ensure_application_settings!
Default Organization in db/fixtures/production/030_default_organization.rb

But these only find_or_create one record each.

/cc @alexpooley

In my opinion, we should set reasonable expectations for when any application, including GitLab, operates correctly. Database migrations are absolutely critical to lay out the structure (and perhaps, as in this case, data) for GitLab to work.

I think it is fair to say that other parts of GitLab would fall apart if a customer upgrades GitLab, but does not run migrations, so I am not sure the arguments in this comment mean we shouldn't use migrations for this. A lazy-insertion approach leads to defensive programming where we constantly need to second-guess whether some data the application needs exist or not. It should be handled during bootstrapping instead.

That said: is there a way to tackle this from the opposite end, i.e. why are customers even able to disable something like migrations? Specifically:

Omnibus didn't seed the database since auto_migrate was disabled.

Why would we even allow this or what motivation would a customer have to not run migrations? Aren't we shooting ourselves in the foot with this switch? My expectation would be that GitLab won't function correctly unless migrations run so it's a little like giving someone the option to take the steering wheel off and wishing them a safe journey.

Someone forgot to run gitlab-rake db:migrate.

Related to 1: this should not even be necessary outside of some sort of recovery/incident scenario.

Omnibus failed during install, preventing the seeds from running.

Fair point -- sounds like an edge case though, and why would I assume anything should work if installation failed?

Thank you for looking into this, @tkuah @mkaeppler!

I have added details about how much required data is upserted in required data insert volume.

I see we have multiple on the comment above by @mkaeppler. I think making the assumption that this data exists in the DB will definitively make our code a lot cleaner, and we can definitively be less defensive in migrations to add new types or widget definitions. I think it would even make more sense for migrations to fail if we find data in an unexpected state as this might bring errors hard to debug.

So, if you all agree on this, what should we do to move down this path? I guess we should improve documentation in places where we might see edge cases like the ones discussed above. Probably make it clear everywhere that seeding the DB is essential for the app to work properly. Maybe even add some kind of initialization step that checks all fixtures have been seeded? We can probably have some value in the DB to indicate that it has been seeded successfully?

Adding @stanhu since you were involved in the initial discovery of the edge cases described in #353552 (comment 901589414)

Wasn't one of the problems with using migrations is that we, eventually, compress migrations down, or set a new starting point? I think data migrations (such as adding new type) don't get preserved in those cases. I think that's one reason we didn't do that originally. Or am I mis-remembering this?

Yes, @digitalmoksha, we definitively have to do both. Migrations are necessary for instances that have already upserted the required types and definitions, and we always need to write a migration if the structure of the required data changes.

But, for new instances, migrations should not be run, in this case we probably do (and someone can confirm as I'm not sure how onminus or other types of installs work) rake db:create db:schema:load and then seed the DB. And of course as you mentioned, migrations get squashed not sure how frequently, so in this scenario we also rely on the data we seed (currently through the seed-fu gem)

Squashed migrations is a good point. @digitalmoksha

@mcelicalderonG About seeding:

Unclear bits

We have auto_migrate setting (e.g. https://docs.gitlab.com/ee/administration/reference_architectures/25k_users.html#configure-consul) https://docs.gitlab.com/omnibus/settings/database.html#disabling-automatic-database-migration
It is not clear if running that seeding step is optional, and what the impact of not running the step completely is.

So I agree we need to

Rely on seeding, and migrations.
Clearly document what seeding does, what gets created, and what errors will arise if seed is not created
Fail early when any expected base item is not present. I am not sure if we can do this immediately when new items are added to production seeds, we may need to gradually introduce this at major milestones instead.

Great summary, thanks Thong!

About migration squashing/cleanup: don't we have checkpoint releases to deal with just that? So that if I want to upgrade from 14 to 16 say, then I can't just do that. I have to go through N intermediate versions first so I actually go through all of the migrations.

About migration squashing/cleanup: don't we have checkpoint releases to deal with just that? So that if I want to upgrade from 14 to 16 say, then I can't just do that. I have to go through N intermediate versions first so I actually go through all of the migrations.

That's a great point.

So we could still rely on migrations with DML for seeding the database, as long as we ensure that we force a checkpoint release before we squash, and ensure we don't add any more DML migrations between the checkpoint release and the squash.

The safest way to do this would probably be to immediately squash after each checkpoint release as a matter of process.

Which is probably good anyway, to keep down the number of migrations, since the more migrations we have, the slower each fresh DB setup is (including all CI jobs).

Thank you for all the links, @tkuah!

We have auto_migrate setting

I'm not sure if this setting will actually prevent seeding the DB for new installs, from what I have read so far it seems that it only prevents migrations from running when upgrading Gitlab versions?

https://docs.gitlab.com/charts/charts/gitlab/migrations/ (runs https://gitlab.com/gitlab-org/build/CNG/-/blob/master/gitlab-rails/scripts/db-migrate)

https://gitlab.com/gitlab-org/build/CNG/-/blob/b49fef503daa24beeb5a62ed124b969a39d40e8e/gitlab-rails/scripts/db-migrate#L22 will run gitlab:db:configure which for new installs will do db:schema:load and then db:seed_fu

Also, in https://gitlab.com/gitlab-org/gitlab/-/blob/9d6fe00589187adb6579bac4020fbb39c93a222f/lib/tasks/gitlab/setup.rake#L35 we do db:reset which simply does a db:drop and then db:setup that will also do a db:schema:load and not run the migrations.

So I don't think setting up a new Gitlab install has a way to avoid seeding if you setup the DB structure properly?

About migration squashing/cleanup: don't we have checkpoint releases to deal with just that? So that if I want to upgrade from 14 to 16 say, then I can't just do that. I have to go through N intermediate versions first so I actually go through all of the migrations.

Right, @mkaeppler, that is an important point, but as you said I guess we are safe if we don't allow skipping major version upgrades.

So we could still rely on migrations with DML for seeding the database, as long as we ensure that we force a checkpoint release before we squash, and ensure we don't add any more DML migrations between the checkpoint release and the squash.

@cwoolley-gitlab I think we can, for any instance that is not brand new. For these, we always have to write migrations since the seeds will not be run in every upgrade, will they? But for new installs, migrations will not run as I said above, we will simply load the structure of the DB and then seed it, as well as marking all migrations as run, without actually running them.

So, unless I'm missing something, I'm not sure how #353552 (comment 901589414) happened. Probably just a failure during the setup steps that didn't seed the DB successfully?

I think we can, for any instance that is not brand new. For these, we always have to write migrations since the seeds will not be run in every upgrade

Ah, that's right. So for new instances, we can't rely on migrations, still have to get the DML run to add all the necessary seed data.

So apologies for my unfamiliarity and jumping in with questions, but I'm trying to understand how this works in GitLab.

My question is: Is seed_fu is the right answer here for all DML seeding in all possible combination of scenarios:

new instances
migrations of existing instances
in all environments, dev/test/prod

I think that may just be restating the intent of this issue, but I don't know what the answer is based on the comments above, and can't tell by looking at the various db:seed_fu hits when searching the codebase.

My question is: Is seed_fu is the right answer here for all DML seeding in all possible combination of scenarios:

new instances

migrations of existing instances

in all environments, dev/test/prod

@cwoolley-gitlab That's a good way to summarize it, and I think the answer is that we require one for each of the items you listed:

new instances: These will create the DB by loading the structure in structure.sql files and then use seed_fu to seed the data for the first time. Migrations won't be run for new instances.
migrations of existing instances: These won't seed the DB, so we rely on the current state of data in the DB and migrations that we have written since the time the instance was setup.
in all environments, dev/test/prod: dev/prod envs will use a combination of items described above (seeds and migrations for first time installs and further upgrades). For specs we seed the DB before suite in https://gitlab.com/gitlab-org/gitlab/-/blob/3652f8cfc2629a76380d1231f8255054971e5e9f/spec/spec_helper.rb#L253 and also avoid migration/delete specs to drop those tables. But there are some pipeline jobs that might need updating so we can have the initial state of the DB in a similar state a prod or dev app is as discussed in !128307 (comment 1525679811)

new instances: These will create the DB by loading the structure in structure.sql files and then use seed_fu to seed the data for the first time. Migrations won't be run for new instances.

migrations of existing instances: These won't seed the DB, so we rely on the current state of data in the DB and migrations that we have written since the time the instance was setup.

So, based on the two above points, it seems like we need to have a clearly-defined process for handling DML/seed data, and specifically how to handle it when we squash migrations:

All DML seed data must start out in a migration (so they get applied to existing instances)
As part of the process to squash migrations, all DML representing seed data in the squashed migrations must be moved to seed_fu (so it will be applied to future new instances)

Does that sound correct?

If so, do we currently have the details of this process documented?

Or do we have any docs around the migration squashing process at all?

I did a quick search and this was the only reference I found, but it's kind of a hard search to perform because of false hits on squash as related to git.

Does that sound correct?

@cwoolley-gitlab I think we should always create migrations for existing installs, but I don't think we should wait for the migrations to be squashed before we add the required data to the fresh install seeds. I think the seeds should always be in sync with the state of the data we expect to have from the migrations we write, given that new installs won't run the migrations but simply load the structure and then run the seeds.

If so, do we currently have the details of this process documented?

From what I have found, I don't think this is properly documented, and I think only in this issue, many have come to an agreement that we can/should rely on seed data as expressed in #423483 (comment 1533122991). So, I think that updating the docs to reflect what has been discussed in this issue is precisely what the outcome of this issue should be.

Or do we have any docs around the migration squashing process at all?

I haven't found anything else myself, so this is something that should also be mentioned when we update the docs.

One more thing we need to keep in mind/document is how we should handle this seed data or how to provide users a quick way to fix data in the DB by following some steps. Specifically talking about work_item_types in the DB, we always upsert the types, so it's safe to seed many times and the end result will always be the same. But, I'm not sure if this is true for all of the seed data. Anyway, perhaps this is not a problem since we expect seeds to be run only once for fresh install. For example a way to fix inconsistent work item types would be as simple as running this same line in a console https://gitlab.com/gitlab-org/gitlab/-/blob/999757b61c3861420b7c7174112e94a75f749027/db/fixtures/production/003_create_base_work_item_types.rb#L4

@mcelicalderonG That all sounds correct, thanks

So it looks like lib/tasks/gitlab/db/migration_squash.rake is the only canonical source of truth on how we squash migrations. I could not find any references in code/docs/handbook either.

So yes, the squash process could definitely use some docs.

@cwoolley-gitlab I think we should always create migrations for existing installs, but I don't think we should wait for the migrations to be squashed before we add the required data to the fresh install seeds. I think the seeds should always be in sync with the state of the data we expect to have from the migrations we write, given that new installs won't run the migrations but simply load the structure and then run the seeds.

Yes, this should be true. One thing I didn't notice on first glance (docs would help here) is that there's two files:

db/init_structure.sql - used by the first squashed migration, and presumably does NOT contain any schema DDL for unsquashed migrations
db/structure.sql - DOES contain full DDL of all migrations, squashed and unsquashed, and is used by clean installs, db:test:prepare, etc.

It might be more intuitive to rename db/init_structure.sql to db/squashed_migrations_structure.sql or something.

Specifically talking about work_item_types in the DB, we always upsert the types, so it's safe to seed many times and the end result will always be the same. But, I'm not sure if this is true for all of the seed data. Anyway, perhaps this is not a problem since we expect seeds to be run only once for fresh install.

Are we sure we can count on this?

It seems safer to enforce a rule that DML in both migrations and seed data will always be upserts, so either one may be run idempotently multiple times, in any order, and still always result in the identical seed data records in the DB.

Are we sure we can count on this?

@cwoolley-gitlab At the moment, running the same version of the seed will simply upsert, so yes. Of course, if a different version of the seeder say removes an entry or renames it, the other one with a different name will remain on the DB. But of course, this is something we handle with migrations for existing installs.

It seems safer to enforce a rule that DML in both migrations and seed data will always be upserts, so either one may be run idempotently multiple times, in any order, and still always result in the identical seed data records in the DB.

Yes, I agree we should enforce this somehow. Perhaps a rubocop rule that prevents methods like create, new or save on models, in favor of method like usert_all as the only way to create this records. Of course, we should properly document this too

Setting label(s) Category:Team Planning based on groupproject management.

added Category:Team Planning label

mentioned in issue #422952 (closed)

changed the description

assigned to @mcelicalderonG

mentioned in merge request !138961 (merged)

mentioned in issue #459904 (closed)

Hello everyone! So I haven't been paying attention to this one in a while, but turns out we found another reason to get rid of the code that "makes sure" certain data exists in the DB before performing an action (creating an issue for example).

We had a discussion recently on https://gitlab.slack.com/archives/C3NBYFJ6N/p1714539272456519 (internal only) around this and found another reason to get rid of this. I have created Don't upsert work item types if not found in th... (!151817 - merged) to ship the change behind a feature flag (enabled by default), just so we can ask users to disable the FF in the event they run into problems like the ones we discussed in the past (but probably point them in the right direction which would be fixing install errors that prevented the seeds from running). With the flag shipped in 17.0 we might be able to further analyze what is going wrong with some installs.

This issue would remain open until we make the doc updates discussed in the thread.

@cablett @cwoolley-gitlab @tkuah @mkaeppler WDYT?

WDYT?

Seems like this is a good step in the direction to making seeding more reliable.

This issue would remain open until we make the doc updates discussed in the thread.

But there were some other ideas/concerns mentioned in the thread above, e.g. around squashing.

I would not close this issue until we have explored them, or else captured them in follow-up issues.

FYI Don't upsert work item types if not found in th... (!151817 - merged) was merged and will be released in 17.0 behind a default enabled FF. I have added a much clearer error message for when the types are not seeded. We can probably remove the flag in 17.1 after we know how this has affected some new installs (hopefully error message will be enough for them to solve the problem)

Who can we ensure knows about this error so we can get data about how often new installs run into it? I thought maybe Support?

The error message has a link to this issue. But in the past we have gotten reports in other issues that people open. Hopefully they will go straight here this time if necessary

Great @mcelicalderonG ! Can we maybe add a section to the description, something like "What do I do if I see this error?"

We sure can/should. I'll update the description ASAP

mentioned in merge request !151817 (merged)

changed the description

mentioned in issue #461487 (closed)

marked this issue as related to #461487 (closed)

mentioned in merge request !153405 (closed)

mentioned in epic &15272 (closed)

mentioned in epic gitlab-org#15272 (closed)

Are there any details on what to do to fix this when getting this error on an existing install? The guidance on this ticket only seems to reference what to do for new installs - unless I have missed something. I noticed this issue today when trying to create the first issue on a project.

This is an install that has been running for some time and has been upgraded multiple times (following the upgrade paths on https://gitlab-com.gitlab.io/support/toolbox/upgrade-path/).

We are running GitLab: 17.4.2-ee (e85e7bae) EE (Installed via the gitlab-ee package on Ubuntu 20.04)

When clicking "New issue" on a project (Which doesn't have any existing issues open) I get an error 500 and this is logged:

WorkItems::Type::DEFAULT_TYPES_NOT_SEEDED (Default work item types have not been created yet. Make sure the DB has been seeded successfully. See related documentation in `https://docs.gitlab.com/omnibus/settings/database.html#seed-the-database-fresh-installs-only

If you have additional questions, you can ask in `https://gitlab.com/gitlab-org/gitlab/-/issues/423483):

I can confirm the work_item_types table is missing entries (I compared to a fresh install).

gitlabhq_production=> select * from work_item_types;
 id | base_type | cached_markdown_version |    name    | description | description_html |      icon_name       |          created_at           |          updated_at        
   
----+-----------+-------------------------+------------+-------------+------------------+----------------------+-------------------------------+-------------------------------
  1 |         5 |                         | Objective  |             |                  | issue-type-objective | 2024-01-17 14:59:24.174475+00 | 2024-01-17 14:59:24.174475+00
  2 |         6 |                         | Key Result |             |                  | issue-type-keyresult | 2024-01-17 14:59:24.175198+00 | 2024-01-17 14:59:24.175198+00
  3 |         7 |                         | Epic       |             |                  | issue-type-epic      | 2024-01-17 15:48:45.438015+00 | 2024-01-17 15:48:45.438015+00
  4 |         8 |                         | Ticket     |             |                  | issue-type-issue     | 2024-01-17 15:48:45.662779+00 | 2024-01-17 15:48:45.662779+00
(4 rows)

Hey, @joolswills! Sorry I missed your original message. I wonder how you ended up with those types. It's very weird that you only have the newer ones and that Objective has id 1

Does that mean you don't have any records in the issues table that have a type issue? Because there's a FK contraint on issues.work_item_type_id I don't see any other way. Can you confirm you have the FK contraint on the column?

Because you are running e85e7bae it's safe to upsert the types and related records and it should be back to normal (we cannot do this in 17.5 anymore as we are now explicit about the IDs and fixing the records might be a bit different). In a Rails console you can run the commands below to make sure you have work item types in the state they should

Gitlab::DatabaseImporters::WorkItems::BaseTypeImporter.upsert_types
Gitlab::DatabaseImporters::WorkItems::HierarchyRestrictionsImporter.upsert_restrictions
Gitlab::DatabaseImporters::WorkItems::RelatedLinksRestrictionsImporter.upsert_restrictions

Let me know if that works as fixing the install should be a priority, but then I'd love if we could discuss a bit more how this might have happened

UPDATE:

Are there any details on what to do to fix this when getting this error on an existing install? The guidance on this ticket only seems to reference what to do for new installs

I will update the description to mention this, but for existing installs there might not be a single or unique way to fix it as I said above.

Thank you for the reply and guidance.

Yes the issue table was empty. It has this constraint:

fk_b37be69be6" FOREIGN KEY (work_item_type_id) REFERENCES work_item_types(id)

Just before you replied I managed to fix a copy of our install I was debugging on. I think my fix is doing a similar thing as your instructions above but I will test. I temporarily reverted the changes to app/models/work_items/type.rb from 9c64ea2c so the items were added on issue creation.

I'm glad you resolved it, @joolswills. May I ask, what version did you upgrade from? Also, do you know what was the first version of your install? The first types should have been created in new instances in 14.3.0 and in existing instances in 14.2.0. In any version after those, you should get the types populated during the seeding process and up until 9c64ea2c you would also get the types upserted if for any reason you didn't have the types already.

Thank you!

I thought I updated from 17.3.5 to 17.4.2 but looking at the logs I went from 17.3.3 to 17.4.2 (Which was a mistake). I hope I haven't caused any other issues missing out 17.3.5

I am not sure of the first version but it would have been the current version from around Feb 2022 (When the admin user was created). The install has been running for some time, but has only recently had more use (hence no issues being created until now).

I have run into another issue when upgrading from 17.7.6-ee to 17.8.4-ee and I think it may be related to this issue due to some differences in my work_item_types table after the problems before.

Running on Ubuntu 20.04 using the Ubuntu package.

On upgrade I am getting the following error:

PG::UniqueViolation: ERROR:  duplicate key value violates unique constraint "work_item_types_pkey"
DETAIL:  Key (id)=(1) already exists.
/opt/gitlab/embedded/service/gitlab-rails/db/post_migrate/20241218223002_fix_work_item_types_id_column_values.rb:8:in `up'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migration_helpers/restrict_gitlab_schema.rb:33:in `block in exec_migration'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/query_analyzer.rb:83:in `within'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migration_helpers/restrict_gitlab_schema.rb:30:in `exec_migration'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migration_helpers/automatic_lock_writes_on_tables.rb:21:in `exec_migration'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/with_lock_retries.rb:123:in `run_block'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/with_lock_retries.rb:134:in `block in run_block_with_lock_timeout'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/connection_proxy.rb:127:in `public_send'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/connection_proxy.rb:127:in `block in write_using_load_balancer'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/load_balancer.rb:141:in `block in read_write'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/load_balancer.rb:228:in `retry_with_backoff'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/load_balancer.rb:130:in `read_write'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/connection_proxy.rb:126:in `write_using_load_balancer'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/load_balancing/connection_proxy.rb:78:in `transaction'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/with_lock_retries.rb:129:in `run_block_with_lock_timeout'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/with_lock_retries.rb:97:in `run'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migrations/lock_retry_mixin.rb:52:in `ddl_transaction'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migrations/runner_backoff/active_record_mixin.rb:21:in `execute_migration_in_transaction'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migrations/pg_backend_pid.rb:28:in `block in with_advisory_lock_connection'
/opt/gitlab/embedded/service/gitlab-rails/lib/gitlab/database/migrations/pg_backend_pid.rb:25:in `with_advisory_lock_connection'
/opt/gitlab/embedded/service/gitlab-rails/lib/tasks/gitlab/db.rake:145:in `configure_database'
/opt/gitlab/embedded/service/gitlab-rails/lib/tasks/gitlab/db.rake:112:in `configure_pg_databases'
/opt/gitlab/embedded/service/gitlab-rails/lib/tasks/gitlab/db.rake:99:in `block (3 levels) in <top (required)>'
/opt/gitlab/embedded/bin/bundle:25:in `load'
/opt/gitlab/embedded/bin/bundle:25:in `<main>'

The SQL being run from /opt/gitlab/embedded/service/gitlab-rails/db/post_migrate/20241218223002_fix_work_item_types_id_column_values.rb is

UPDATE work_item_types SET id = correct_id;\

$ sudo gitlab-psql 
psql (14.15)
Type "help" for help.

gitlabhq_production=# UPDATE work_item_types SET id = correct_id;
ERROR:  duplicate key value violates unique constraint "work_item_types_pkey"
DETAIL:  Key (id)=(1) already exists.

Here's the contents of the table

gitlabhq_production=# gitlabhq_production=# select * from work_item_types;
 id | base_type | cached_markdown_version |    name     | description | description_html |        icon_name        |          created_at           |          updated_at           | correct_i
d | old_id 
----+-----------+-------------------------+-------------+-------------+------------------+-------------------------+-------------------------------+-------------------------------+----------
--+--------
  5 |         0 |                         | Issue       |             |                  | issue-type-issue        | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
1 |      5
  6 |         1 |                         | Incident    |             |                  | issue-type-incident     | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
2 |      6
  7 |         2 |                         | Test Case   |             |                  | issue-type-test-case    | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
3 |      7
  8 |         3 |                         | Requirement |             |                  | issue-type-requirements | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
4 |      8
  9 |         4 |                         | Task        |             |                  | issue-type-task         | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
5 |      9
  1 |         5 |                         | Objective   |             |                  | issue-type-objective    | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
6 |      1
  2 |         6 |                         | Key Result  |             |                  | issue-type-keyresult    | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
7 |      2
  3 |         7 |                         | Epic        |             |                  | issue-type-epic         | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
8 |      3
  4 |         8 |                         | Ticket      |             |                  | issue-type-issue        | 2024-10-17 07:44:42.481363+00 | 2024-10-17 07:44:42.481363+00 |          
9 |      4
(9 rows)

and the table schema:

# \d+ work_item_types;
                                                      Table "public.work_item_types"
         Column          |           Type           | Collation | Nullable | Default | Storage  | Compression | Stats target | Description 
-------------------------+--------------------------+-----------+----------+---------+----------+-------------+--------------+-------------
 id                      | bigint                   |           | not null |         | plain    |             |              | 
 base_type               | smallint                 |           | not null | 0       | plain    |             |              | 
 cached_markdown_version | integer                  |           |          |         | plain    |             |              | 
 name                    | text                     |           | not null |         | extended |             |              | 
 description             | text                     |           |          |         | extended |             |              | 
 description_html        | text                     |           |          |         | extended |             |              | 
 icon_name               | text                     |           |          |         | extended |             |              | 
 created_at              | timestamp with time zone |           | not null |         | plain    |             |              | 
 updated_at              | timestamp with time zone |           | not null |         | plain    |             |              | 
 correct_id              | bigint                   |           | not null | 0       | plain    |             |              | 
 old_id                  | bigint                   |           |          |         | plain    |             |              | 
Indexes:
    "work_item_types_pkey" PRIMARY KEY, btree (id)
    "index_work_item_types_on_base_type_and_id" btree (base_type, id)
    "index_work_item_types_on_correct_id_unique" UNIQUE, btree (correct_id)
    "index_work_item_types_on_name_unique" UNIQUE, btree (TRIM(BOTH FROM lower(name)))
Check constraints:
    "check_104d2410f6" CHECK (char_length(name) <= 255)
    "check_fecb3a98d1" CHECK (char_length(icon_name) <= 255)
Referenced by:
    TABLE "issues" CONSTRAINT "fk_1adaba52b0" FOREIGN KEY (correct_work_item_type_id) REFERENCES work_item_types(correct_id)
    TABLE "work_item_type_custom_fields" CONSTRAINT "fk_9447fad7b4" FOREIGN KEY (work_item_type_id) REFERENCES work_item_types(correct_id) ON DELETE CASCADE
    TABLE "work_item_hierarchy_restrictions" CONSTRAINT "fk_work_item_hierarchy_restrictions_child_type_id" FOREIGN KEY (child_type_id) REFERENCES work_item_types(id) ON UPDATE CASCADE ON DELETE CASCADE
    TABLE "work_item_hierarchy_restrictions" CONSTRAINT "fk_work_item_hierarchy_restrictions_parent_type_id" FOREIGN KEY (parent_type_id) REFERENCES work_item_types(id) ON UPDATE CASCADE ON DELETE CASCADE
    TABLE "work_item_related_link_restrictions" CONSTRAINT "fk_work_item_related_link_restrictions_source_type_id" FOREIGN KEY (source_type_id) REFERENCES work_item_types(id) ON UPDATE CASCADE ON DELETE CASCADE
    TABLE "work_item_related_link_restrictions" CONSTRAINT "fk_work_item_related_link_restrictions_target_type_id" FOREIGN KEY (target_type_id) REFERENCES work_item_types(id) ON UPDATE CASCADE ON DELETE CASCADE
    TABLE "work_item_widget_definitions" CONSTRAINT "fk_work_item_widget_definitions_work_item_type_id" FOREIGN KEY (work_item_type_id) REFERENCES work_item_types(id) ON UPDATE CASCADE ON DELETE CASCADE
Access method: heap

Any help for this would be much appreciated. Please let me know if you need any further details and if this should be a new issue, I am happy to open one.

Thank you for the report, @joolswills. Yes, I received a similar report around this. I see the error on our end now. The migration assumed IDs in the work_item_types table would be sequential in the order types were added to the application. Not sure why your installation and at least one more that I have seen, created types in a different order. Because this was released in 17.8 and that is a required stop, backporting a fix for 17.8 for this migration is not covered in our policy (only security issues are backported 3 older releases).

But the good news is that this has an easy fix. So executing the following query should fix the migration.

IMPORTANT: At the time of writing, this is only safe to run if you have already upgraded to 17.8 and are trying to run post_deployment migrations

UPDATE work_item_types set id = (id * 10);

This way, you won't get conflicts in the existing records. And running the 20241218223002_fix_work_item_types_id_column_values.rb migration should not error anymore. I'll document this possible error somewhere around the upgrade docs

Thank you! Tested and working on my development set-up.

I have added this to the version 17 upgrade docs in https://docs.gitlab.com/update/versions/gitlab_17_changes/#issues-to-be-aware-of-when-upgrading-to-178

changed the description

I encountered this issue on a fresh installation of GitLab, version 17.4.2(ce), which was deployed using Helm charts.

table work_item_types is empty.

I manually created some records (taken from another working instance), and the issue was resolved.

INSERT INTO public.work_item_types VALUES (1, 0, NULL, 'Issue', NULL, NULL, 'issue-type-issue', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (2, 1, NULL, 'Incident', NULL, NULL, 'issue-type-incident', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (3, 2, NULL, 'Test Case', NULL, NULL, 'issue-type-test-case', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (4, 3, NULL, 'Requirement', NULL, NULL, 'issue-type-requirements', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (5, 4, NULL, 'Task', NULL, NULL, 'issue-type-task', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (6, 5, NULL, 'Objective', NULL, NULL, 'issue-type-objective', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (7, 6, NULL, 'Key Result', NULL, NULL, 'issue-type-keyresult', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (8, 7, NULL, 'Epic', NULL, NULL, 'issue-type-epic', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');
INSERT INTO public.work_item_types VALUES (9, 8, NULL, 'Ticket', NULL, NULL, 'issue-type-issue', '2024-10-22 06:24:23.175214+00', '2024-10-22 06:24:23.175214+00');

@nanjingfm1 do you know if as part of the installation process the DB seeds were executed?

Those records look good, but there are other that you might have missed if the seeds were not run. At least related to work item types I would advise that you run the following commands in a Rails console (should be safe to run with the records you already created)

Gitlab::DatabaseImporters::WorkItems::BaseTypeImporter.upsert_types
Gitlab::DatabaseImporters::WorkItems::HierarchyRestrictionsImporter.upsert_restrictions
Gitlab::DatabaseImporters::WorkItems::RelatedLinksRestrictionsImporter.upsert_restrictions

Hello everyone,

We've just experienced the same bug. We aren't entirely sure when this happened but we noticed this while running the Gitlab performance tool, it failed after importing the first large project because it couldn't import the 6k something issues.

We've deployed a Gitlab Ultime in 3k reference architecture on-prem with the Gitlab Environment Toolkit and upgraded from 17.6.0 to 17.6.1 a few weeks ago and yesterday to 17.6.2.

The upgrades were deployed using the GET ansible suite as well

We've found this issue and employed the following command into the gitlab-rails console

Gitlab::DatabaseImporters::WorkItems::BaseTypeImporter.upsert_types
Gitlab::DatabaseImporters::WorkItems::HierarchyRestrictionsImporter.upsert_restrictions
Gitlab::DatabaseImporters::WorkItems::RelatedLinksRestrictionsImporter.upsert_restrictions

Which populated the work_item_types table which was completely empty before.

@cpelzer thank you for reporting this. Running those 3 commands on the console should be enough to fix this problem. What installation method did you use in the Gitlab Environment Toolkit? Not sure if there's an alternative to using omnibus.

Just to be clear, this was a brand new installation, correct?

Hey,

There is multiple methods but we have set it up with the gitlab debian packages on VMs running on Proxmox ( that's the way the GET sets it up on-prem )

The installation itself is still a PoC environment and was set-up earlier this year and since then has gone through two upgrades to newer patch versions.

And yes the three commands easily fixed the problems!

@cpelzer as part of the installation did you seed the DB as described in https://docs.gitlab.com/omnibus/settings/database.html#seed-the-database-fresh-installs-only? This is the steps that will run the commands you manually did as well as other seeds. I'm not very familiar with the installation process, but from what I can tell in the docs, the first run of gitlab-ctl reconfigure should seed the database. But at the same time, https://docs.gitlab.com/omnibus/settings/database.html#seed-the-database-fresh-installs-only says that DB won't be seeded and you have to do it manually. I will ask the team that might know more about the installation process

There is a Rake task (gitlab:db:configure) which will perform all database migrations (except batched background migrations), including seeding. It will only seed the database if the schema is empty. That rake task is called as part of gitlab-ctl reconfigure, assuming that automatic database migrations have not been disabled.

There should never be a reason to manually seed the database.

So since we're still in a PoC state I just removed the entire gitlab and re-deployed it using GET and deployed version 17.8.1 in a fresh installation and ran into the same problem.

gitlab-ctl reconfigure did not seed the database

It will only seed the database if the schema is empty

@cpelzer was your DB schema empty? No tables created yet?

mentioned in merge request !183890 (merged)

[Discussion] How to handle required work item types to exist in DB before app run

If you are getting this error on your new instance

If you are getting this error on an existing instance

Problem

Required data insert volume

work_item_types table

work_item_widget_definitions table

Possible solutions

Designs

Child items ...

Activity

[Discussion] How to handle required work item types to exist in DB before app run

If you are getting this error on your new instance

If you are getting this error on an existing instance

Problem

Required data insert volume

work_item_types table

work_item_widget_definitions table

Possible solutions

Relates to

Activity