As seen in https://gitlab.com/gitlab-org/gitlab/-/jobs/6097005150, db:migrate:multi-version-upgrade-2 failed with ActiveRecord::StatementInvalid: PG::DuplicateColumn: ERROR: column "encrypted_token" of relation "chat_names" already exists
As such confirmed that approach works as expected.
Nailia Iskhakovamarked the checklist item Validate that data selected approach work - create DB dump for known problematic upgrade path and verify that there is a migration error as completed
marked the checklist item Validate that data selected approach work - create DB dump for known problematic upgrade path and verify that there is a migration error as completed
Explore if there is a way to seed environment with all possible fixtures
As part of testing options in this area I configured local GDK setup. Also using Docker image for 16.3 for additional validation in non-dev env (note that Docker image has tweaks from db:migrate:multi-version-upgrade-1 code.
As long as create or build with &:save are used - ActiveRecord validations are used and unhealthy record won't be written to DB
Notes so far from few crude tests:
When using GitLab Rails console
I created this crude code for cycling through all factories and creating them:
factories=FactoryBot.factoriesresults=factories.each_with_indexdo|factory,i|# Create a new instance for each factorybeginFactoryBot.create(factory.name)rescueException=>eputs"#{factory.name} caught exception #{e}!"elseputs"success #{factory.name}"ensureputs"Factory ##{i}"endend
It worked both in GDK (test with first 5 factories) and Docker image (run full command).
Some of the factories failed with PG::UniqueViolation: ERROR for example:
But the majority finished. When checking ActiveRecords - records are in place.
When running similar script to above as part of Seeder rake
moduleGitlabmoduleDataSeederclass<<self# Seed test data using GitLab Data Seeder# @param [String] seed_file the full-path of the seed file to load (.yml, .rb)defseed(owner,seed_file)....defseed_allFactoryBot.factoriesdo|factory|# Create a new instance for each factoryGitlab::AppLogger.debugfactory.nameFactoryBot.create(factory.name)endendend
The create part silently fails due to missing variables. For example for the first factory
FactoryBot.create(factory.name)NameError:uninitializedconstantGitlab::Pagination::Keyset::Orderfrom/GitLab/gitlab-development-kit/gitlab/config/initializers/active_record_keyset_pagination.rb:22:in`block in reverse_sql_order'
To be clarified.
Failures in rake. I'm not sure yet what is difference between Rails console and running above in Rake. It looks like something is missing.
In general though it might look like that this can be an option to seed everything and then also develop a custom Seeder config with designed data for granular control?
Maintenance concern - if Factory name changes -> our customer Seeder rb file config will start failing/not creating fixtures that uses old name. How to keep it up to date/catch this? (when we move data seeding to a separate project)
The odds of this are very low. Even if this were to happen, a global refactor across the entire codebase would be necessary. Since this seed file would reside in the gitlab-org/gitlab project, this should be picked up.
In my time of Rails development, though - I've not seen any change of factory name. It would require Deletion, or Change of name of a Model of the Rails base. (For instance, if we were to rename Merge Request to Pull Request. (create(:merge_request) => create(:pull_request))
Keep in mind that the seed file must be of this format, as documented here
Yes, I tried adding something like this originally for testing
# test_data.rbclassDataSeederdefseedFactoryBot.factories.first(10)do|factory|# Create a new instance for each factoryGitlab::AppLogger.debugfactory.nameFactoryBot.create(factory.name)endendend
It didn't seed and failed with the same uninitialized constant Gitlab::Pagination::Keyset::Order, I'm probably missing something fundamental here as I'm still reading up about how factories work
I pulled the latest master in my GDK today with gitlab-org/gitlab!137582 (merged) is merged. I'm seeing the same NameError: uninitialized constant errors.
The odds of this are very low. Even if this were to happen, a global refactor across the entire codebase would be necessary.
Got it, thanks for the additional context Good to know that concern should be less as long as seed file lives in the project. I was thinking that perhaps at some point we would want to maintain few different Seeder configs for upgrade testing specifically that will be stored outside of the main repo. Will give it a further think after I get up to speed with this area first.
In the https://gitlab.com/gitlab-org/gitlab/-/jobs/6163413795 we can see that there are 351 successfully created fixtures (search ....success) and 161 failed fixtures (caught exception) if running against 16.3.0 Docker image.
When running same seeder file against local GDK( 16.9.0-pre 3a666cdff3f) I've seen 150 out of 963 fixtures failed with errors.
Fixture errors
1. No such file or directory @ rb_sysopen
Most common error - No such file or directory @ rb_sysopen - /opt/gitlab/embedded/service/gitlab-rails/tmp/tests/gitlab-test.bundle!
Running scripts/gitaly-test-build which should build these bundles in GDK failed:
scripts/gitaly-test-build/Users/niskhakova/GitLab/gitlab-development-kit/gitlab/spec/support/helpers/gitaly_setup.rb:92:in`system': No such file or directory - make (Errno::ENOENT) from /Users/niskhakova/GitLab/gitlab-development-kit/gitlab/spec/support/helpers/gitaly_setup.rb:92:in `run_command' from /Users/niskhakova/GitLab/gitlab-development-kit/gitlab/spec/support/helpers/gitaly_setup.rb:96:in `build_gitaly'fromscripts/gitaly-test-build:21:in`run' from scripts/gitaly-test-build:42:in `<main>'
Also side note - I believe the error with above explains why MR seeded with bulk_data has missing branches:
In gitlab-test.bundle there is 0b4bc9a49b562e85de7cc9e834518ea6828729b9 refs/heads/feature ref - which should have been used in above MR.
2. PG::UniqueViolation: ERROR
Second common error. Most popular exception is for index_ci_namespace_mirrors_on_namespace_id - 41/45 cases.
Debugging:
To be done
3. Postgres factories related errors
Factory #206reindex_action caught exception PG::ObjectNotInPrerequisiteState: ERROR: cannot insert into view "postgres_indexes"DETAIL: Views that do not select from a single table or view are not automatically updatable.HINT: To enable inserting into the view, provide an INSTEAD OF INSERT trigger or an unconditional ON INSERT DO INSTEAD rule.!
I assume it's expected to fail against live envs so can be ignored?
@niskhakova As you've already found, gitaly_setup.rb will attempt to build gitaly from source unless the environment variable CI is set. That is why it is trying to run make. So for this to succeed you'll either need to add all the build tools to the docker image, or build gitaly separately like CI does and set the environment appropriately.
@proglottis many thanks for the clarification on this as well as the tip about bundle exec rspec spec/lib/backup/gitaly_backup_spec.rb that would initiate TestEnv.init in GDK Confirmed that when running this command, the bundle files were created!
The "normal" GDK way where spec_helper.rb calls TestEnv.init which then calls TestEnv.setup_forked_repo/TestEnv.setup_factory_repo that actually do the cloning/ref-setting/bundle-creating
@ddavison could you please expand on this? In this thread we're exploring how to seed all fixtures, and as found in #2354 (comment 1772988958) - there are some errors that doesn't allow to seed the environment. Resolving the errors allows fixtures to be seeded.
The data is the crucial point of catching migrations, we need a way to seed all possible data and looks like Data Seeder + fixtures is the solution here.
In general, I don't think there is concern about multi-version upgrades without building an environment because we're working on preparing PG dump - this is the only place where environment is being used, however the test itself doesn't need an env (since we import the dump). It will be pulling existing PG dump for specific upgrade stop and not rely on any environment.
27 UniqueViolation errors, out of which 23 are index_ci_namespace_mirrors_on_namespace_id.
Reproduced this in my freshly reinstalled GDK. For example rails console:
gdkrailsconsole...FactoryBot.create(:board_group_recent_visit)# factory is created=>#<BoardGroupRecentVisit:0x000000028223f508id: 1,created_at: Wed,28Feb202417:25:45.371008000UTC+00:00,updated_at: Wed,28Feb202417:25:45.371008000UTC+00:00,user_id: 70,board_id: 1,group_id: 96>FactoryBot.create(:board_group_recent_visit)# on second try failed with the same `index_ci_namespace_mirrors_on_namespace_id`ActiveRecord::RecordNotUnique:PG::UniqueViolation:ERROR:duplicatekeyvalueviolatesuniqueconstraint"index_ci_namespace_mirrors_on_namespace_id"DETAIL:Key(namespace_id)=(100)alreadyexists.FactoryBot.create(:board_group_recent_visit)# didn't fail on third time=>#<BoardGroupRecentVisit:0x0000000128e7ec48id: 2,created_at: Wed,28Feb202417:26:11.482766000UTC+00:00,updated_at: Wed,28Feb202417:26:11.482766000UTC+00:00,user_id: 73,board_id: 2,group_id: 102>
Not sure why rerunning the same command fixes this error.
Slightly changing seed_all_fixtures to retry on ActiveRecord::RecordNotUnique - resolved the issue with index_ci_namespace_mirrors_on_namespace_id:
defseed_all_fixturesFactoryBot.factories.each_with_indexdo|factory,i|retries||=0# Create a new instance for each factoryFactoryBot.create(factory.name)rescueActiveRecord::RecordNotUnique=>eputs"#{factory.name} caught exception #{e}! Attempt##{retries}"retryif(retries+=1)<3rescueException=>e# rubocop:disable Lint/RescueException -- catching all possible exceptions# We rescue exception here to make sure seeding proceeds unrelated to various unique exceptions from Factoriesputs"#{factory.name} caught exception #{e} of class #{e.class}!"elseputs"Successfully created Factory #{factory.name}"ensureputs"Factory ##{i+1}"endend