Tune HPA minimumReplicas for ai-assisted webservice in all zones
What does this MR do?
This is a follow-up to !2969 (merged). It rolls out the change to all zones, allowing us to scale the replicas from 30 down to 3 (or a value close to 3).
refs https://gitlab.com/gitlab-com/gl-infra/scalability/-/issues/2480
Author Check-list
Please read the Contributing document and once you do, complete the following:
-
Check if all of the following apply: - Assign to the correct reviewer per the contributing document
- Apply the correct metadata per the contributing document
- Link to related MRs for applying the changes on other environments
- Link to related Chef changes
- If necessary link to a Criticality 4 Change Request issue
Reviewer Check-list
-
Check if all of the following apply: - Reviewed the diff jobs to confirm changes are as expected
- No changes shown in the diffs not associated with this MR - This may require a rebase or further investigation
Applier Check-list
-
Make sure there is no ongoing deployment for the affected envs before merging (see #announcements slack channel)