Use repmgr to manage Geo postgresql replication
As discussed in https://gitlab.com/gitlab-com/infrastructure/issues/3423#note_53204703
I'm not cognizant of all the details here, but it sounds as though repmgr would know about every postgresql server in every Geo node, and automatically manage the details of failover when one server fails. So perhaps this would also solve our current problems with n-node Geo DR, in addition to HA?
@jarv @ahanselka am I understanding this correctly? What's the current status of repmgr? - integrated into omnibus or something that is managed externally?
repmgr
repmgr is an open-source tool suite for managing replication and failover in a cluster of PostgreSQL servers. It enhances PostgreSQL's built-in hot-standby capabilities with tools to set up standby servers, monitor replication, and perform administrative tasks such as failover or manual switchover operations.