Skip to content

chore: hpa requests limits for model-gateway

Chance Feick requested to merge chore/hpa-requests-limits into main

Setup horizontal pod autoscaling (HPA) for model-gateway to automatically scale replicas as traffic increases. For https://gitlab.com/gitlab-com/gl-infra/reliability/-/issues/23840.

Edited by Chance Feick

Merge request reports