chore: hpa requests limits for model-gateway
Setup horizontal pod autoscaling (HPA) for model-gateway to automatically scale replicas as traffic increases. For https://gitlab.com/gitlab-com/gl-infra/reliability/-/issues/23840.
Edited by Chance Feick