Backfill namespace_ancestry for issues
What does this MR do?
This MR backfills the namespace_ancestry
field for all issue documents in the Elasticsearch index. The logic for calculating the field content has been done in !68796 (merged).
Additionally, this MR extracts the logic for future backfills to Elastic::MigrationBackfillHelper
and uses it for the BackfillNamespaceAncestryForIssues
advanced search migration.
Migration
Note: Ensure you have Elasticsearch enabled and setup for gdk. It is important that initial Elasticsearch setup is done on the default branch and not this branch. Otherwise the migration will already be marked as "completed" and you will not be able to test it using the steps below.
- open rails console
- run the migration using the background job:
Elastic::MigrationWorker.new.perform
andElasticIndexInitialBulkCronWorker.new.perform
to flush the changes to Elasticsearch - follow along in the
log/elasticsearch.log
file{"severity":"INFO","time":"2021-08-25T11:20:11.035Z","correlation_id":null,"message":"MigrationWorker: migration[BackfillNamespaceAncestryForIssues] executing migrate method"} {"severity":"INFO","time":"2021-08-25T11:20:11.043Z","correlation_id":null,"message":"[Elastic::Migration: 20210825110300] Checking if there are documents without namespace_ancestry field: 47427 documents left"} {"severity":"INFO","time":"2021-08-25T11:20:11.043Z","correlation_id":null,"message":"[Elastic::Migration: 20210825110300] Adding namespace_ancestry field to gitlab-development-issues documents for batch of 5000 documents"}
- depending on a number of documents left, you'd need to run
Elastic::MigrationWorker.new.perform
andElasticIndexInitialBulkCronWorker.new.perform
a few times until the migration is completed
Screenshots or Screencasts (strongly suggested)
How to setup and validate locally (strongly suggested)
Does this MR meet the acceptance criteria?
Conformity
-
I have included changelog trailers, or none are needed. (Does this MR need a changelog?) -
I have added/updated documentation, or it's not needed. (Is documentation required?) -
I have properly separated EE content from FOSS, or this MR is FOSS only. (Where should EE code go?) -
I have added information for database reviewers in the MR description, or it's not needed. (Does this MR have database related changes?) -
I have self-reviewed this MR per code review guidelines. -
This MR does not harm performance, or I have asked a reviewer to help assess the performance impact. (Merge request performance guidelines) -
I have followed the style guides. -
This change is backwards compatible across updates, or this does not apply.
Availability and Testing
-
I have added/updated tests following the Testing Guide, or it's not needed. (Consider all test levels. See the Test Planning Process.) -
I have tested this MR in all supported browsers, or it's not needed. - [-] I have informed the Infrastructure department of a default or new setting change per definition of done, or it's not needed.
Security
Does this MR contain changes to processing or storing of credentials or tokens, authorization and authentication methods or other items described in the security review guidelines? If not, then delete this Security section.
- [-] Label as security and @ mention
@gitlab-com/gl-security/appsec
- [-] The MR includes necessary changes to maintain consistency between UI, API, email, or other methods
- [-] Security reports checked/validated by a reviewer from the AppSec team
Related to #335825 (closed)