Capacity Planning: redis Service, redis_primary_cpu resource

CPU on our primary Redis instance is trending up quite steeply

From the Tamland report at https://gitlab-com.gitlab.io/gl-infra/tamland/saturation.html

This is a placeholder for now, but we need to start thinking about next steps for scaling the Redis service (along with Redis-Sidekiq: #590 (closed), which has its own issue)

Possible actions include:

Watch and wait to see if things flatten out
Investigate the sources of traffic and optimize the application
Optimizations to the Redis instance, including upgrading to Redis 6 with --io-threads
Vertical scaling, if GCP offer more powerful instance types (~~Afaik, I don't think they do have any more powerful Intel machines, and we can't experiment with things like AMD EPYC processors yet, as they're in Beta and don't run in us-east1 yet~~ should we investigate N2D AMD EPYC nodes?)
Break off more bits from the Redis instance.
1. Rack sessions seem like a possible option and would also unblock us from Redis Cluster
Address any Redis Cluster key violations in preparation for Redis Cluster
Start preparing to Redis Cluster

Edited Nov 05, 2020 by Andrew Newdigate