Reindex GitLab.com Global Search Elasticsearch cluster main index

Production Change

We have a list of changes we want to apply to GitLab.com main Advanced Search index:

This can be done by reindexing the index.

Services Impacted - Elasticsearch global search
Change Technician - @dgruzd (EMEA) @john-mason (AMER)
Change Reviewer - @terrichu
Time tracking - 2880m
Downtime Component - No downtime, but Advanced Search indexing will be paused for the duration of reindexing

Estimated Time to Complete (mins) - 30m

Estimated Time to Complete (mins) - 2880m

Estimated Time to Complete (mins) - 1m

Estimated Time to Complete (mins) - 60

Metric: Elasticsearch cluster health
- Location: https://00a4ef3362214c44a044feaa539b4686.us-central1.gcp.cloud.es.io:9243/app/monitoring#/overview?_g=(cluster_uuid:HdF5sKvcT5WQHHyYR_EDcw)
- What changes to this metric should prompt a rollback: Unhealthy nodes/indices that do not recover
Metric: Elasticsearch monitoring in Grafana
- Location: https://dashboards.gitlab.net/d/search-main/search-overview?orgId=1
Metric: Indexing queues
- Location: https://dashboards.gitlab.net/d/sidekiq-main/sidekiq-overview?orgId=1
- What changes to this metric should prompt a rollback: After unpausing the indexing is failing and the queues are constantly growing

Does this change introduce new compute instances?
Does this change re-size any existing compute instances?
Does this change introduce any additional usage of tooling like Elastic Search, CDNs, Cloudflare, etc?

Summary of the above

The scheduled day and time of execution of the change is appropriate.
The change plan is technically accurate.
The change plan includes estimated timing values based on previous testing.
The change plan includes a viable rollback plan.
The specified metrics/monitoring dashboards provide sufficient visibility for the change.

Edited Nov 11, 2022 by Dmitry Gruzd