Roll out queue-per-shard to workers of shard `elasticsearch`

Introduction

The Scalability team is working on migrating queue-per-worker to queue-per-shard strategy so that all workers in the same shard belong to a same queue. This is an attempt to reduce daily peak CPU saturation on redis-sidekiq. To achieve such goal, we applied queue routing rules mechanism to determine the destination queue of a particular job when scheduling. This mechanism is a drop-in replacement for queue selector. We already applied and rolled out for default and mailers queue on production (see #1073 (closed) for more information).

We would like to continue to roll out this mechanism for all workers of elasticsearch shard. Some notable information about this shard:

elasticsearch shard stays on Kubernetes cluster.
Queue selector configuration is located here.
Queue selector rule: feature_category=global_search&urgency=throttled
Workers in this shard, filtered from the queue selector rule, and compared with the logs of completed jobs by shard:

Worker	Feature Category	Current Queue	Maintaining Group	7-day job completions
ElasticClusterReindexingCronWorker	global_search	cronjob:elastic_cluster_reindexing_cron	Global Search	1,008
ElasticIndexBulkCronWorker	global_search	cronjob:elastic_index_bulk_cron	Global Search	10,079
ElasticIndexInitialBulkCronWorker	global_search	cronjob:elastic_index_initial_bulk_cron	Global Search	10,080
Elastic::MigrationWorker	global_search	cronjob:elastic_migration	Global Search	336
ElasticCommitIndexerWorker	global_search	elastic_commit_indexer	Global Search	10,080,127
ElasticDeleteProjectWorker	global_search	elastic_delete_project	Global Search	33,349

Pre-check

Before rolling out, the following checklist must be done, to ensure the reliability and safety:

Migrations

Please follow the linked issues for the detailed migration steps.

Appendix

Script to fetch workers of a shard

url = URI("https://gitlab.com/gitlab-com/www-gitlab-com/raw/master/data/stages.yml")
request = Net::HTTP::Get.new(url)
response = Net::HTTP.new(url.host, url.port).tap { |http| http.use_ssl = true }.request(request)
groups = YAML.safe_load(response.read_body)["stages"].values.flat_map { |stage| stage["groups"].values }

worker_metadatas = Gitlab::SidekiqConfig::CliMethods.worker_metadatas
matcher = Gitlab::SidekiqConfig::WorkerMatcher.new('feature_category=global_search&urgency=throttled')

worker_metadatas.select { |w| matcher.match?(w) }.each do |w|
  group = groups.find { |g| g['categories'].include?(w[:feature_category].to_s) }
  puts "| #{w[:worker_name]} | #{w[:feature_category]} | [#{group['name']}](http://about.gitlab.com/#{group['group_link']}) |"
end

Edited Jun 29, 2021 by Quang-Minh Nguyen (Ex-GitLab)