I can't find Sidekiq-on-K8s logs in GCS
Problem
I wanted to get some older (no longer in Elasticsearch) Sidekiq logs for @rpereira2: scalability#957
I saw that gitlab-gprd-logging-archive/sidekiq was populated, so I went to follow https://gitlab.com/gitlab-com/runbooks/-/blob/master/docs/logging/logging_gcs_archive_bigquery.md#accessing-fields-that-cant-be-loaded-due-to-invalid-characters. But I noticed that I don't get all Sidekiq shards, and I think I only get the ones on VMs. Running this in BigQuery:
SELECT JSON_EXTRACT_SCALAR(json, "$.jsonPayload['shard']") as shard, COUNT(*) FROM `gitlab-production.smcgivern.container_registry_migration_pre` GROUP BY shard ORDER BY shard ASC;
Gives:
| Row | shard | f0_ |
|---|---|---|
| 1 | null | 198 |
| 2 | catchall | 50840499 |
| 3 | catchnfs | 56131 |
| 4 | default | 6575744 |
And the jobs I want are from the database-throttled shard: https://thanos.gitlab.net/graph?g0.range_input=3d&g0.end_input=2021-03-18%2000%3A00&g0.moment_input=2021-03-25%2012%3A56%3A08&g0.max_source_resolution=0s&g0.expr=sum%20by%20(shard)%20(increase(sidekiq_jobs_completion_seconds_count%7Bqueue%3D%22background_migration%22%2C%20env%3D%22gprd%22%7D%5B1h%5D))&g0.tab=0
Desired outcome
Someone tells me where these logs are
Acceptance criteria
-
...