chore: hpa requests limits for model-gateway (!211) · Merge requests · GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway · GitLab

Chance Feick requested to merge chore/hpa-requests-limits into main Jun 29, 2023

Setup horizontal pod autoscaling (HPA) for model-gateway to automatically scale replicas as traffic increases. For https://gitlab.com/gitlab-com/gl-infra/reliability/-/issues/23840.

Edited Jul 06, 2023 by Chance Feick