Migrate bastion servers to managed instance groups
Summary
We have lb-bastion.prdsub.gitlab.com
which is a load balancer between 3 instances. Sometimes instances become unhealthy because their network goes down.
In the past production#6482 (closed), production#6530 (closed) we had to restart the machines, which is just manual work for someone to do.
Related Incident(s)
Desired Outcome/Acceptance Criteria
Have the instances automatically restart/re-created as soon as they are unhealthy. We can have the bastion servers in instance groups which will being autohealing and recreate the VM as soon as it becomes unhealthy.
The managed instance group should probably be regional so that we can have it on the different zones (as we have it right now).
Associated Services
Corrective Action Issue Checklist
-
Link the incident(s) this corrective action arose out of -
Give context for what problem this corrective action is trying to prevent from re-occurring -
Assign a severity label (this is the highest sev of related incidents, defaults to 'severity::4') -
Assign a priority (this will default to 'priority::4')