Convert gprd redis-cache to C2 machine type

Context: https://gitlab.com/gitlab-com/gl-infra/infrastructure/-/issues/9636

Production Change - Criticality 2 C2

Change Objective	Change the redis-cache nodes to C2 machine types with better CPUs
Change Type	ConfigurationChange
Services Impacted	ServiceRedis
Change Team Members	@igorwwwwwwwwwwwwwwwwwwww @craigf
Change Criticality	C2
Change Reviewer	@T4cC0re
Tested in staging	#1866 (closed)
Dry-run output	N/A
Due Date	2020-03-31 10:15 UTC (engineer @ 12:15)
Time tracking	2hr

Detailed steps for the change

The change of node type of the other 2 nodes (now replicas) will be completed in the following days, when we have confidence there's no unexpected effects (unlikely, but worth being careful).

Rebuilding nodes whose root disks+filesystems were accidentally grown

You can't shrink ext4 filesystems online, and you can't unmount the root filesystem (well, not without a lot of pivot_root trickery). Since we accidentally grew the root disk+filesystem, we need to rebuild these nodes.

Run these steps on staging first.

If the node is a master, initiate a failover. See above steps.
tf apply -target module.redis-cache
Await chef convergence
1. gcloud --project=gitlab-staging-1 compute instances tail-serial-port-output redis-cache-0N-db-gstg --zone=us-east1-X | grep startup-script

Rollback steps

Force a failover away from the C2 node: /opt/gitlab/embedded/bin/redis-cli -p 26379 SENTINEL failover gprd-redis-cache.
Revert the change in MR
Shutdown the C2 node
Change back to instance type n1-stanard-2: gcloud --project gitlab-production compute instances set-machine-type redis-cache-N-db-gprd --machine-type n1-highmem-16 --zone us-east-1X
Start up the node: gcloud --project gitlab-production compute instances start redis-cache-N-db-gprd --zone us-east1-X
Once the node has started up, verify that redis and sentinel started with sudo gitlab-ctl status
- If necessary, start them manually with sudo gitlab-ctl start
Do not roll back disk resize (it's hard to make this safe)

Changes checklist

Detailed steps and rollback steps have been filled prior to commencing work
Person on-call has been informed prior to change being rolled out

Based on the work by @cmiskell in #1829 (closed).

Edited Mar 31, 2020 by Igor