[META] Database Backup/Redundancy Improvements
Goals: - Faster/More Frequent Staging Refreshes (infrastructure#2718) - Improve SLA numbers - Automate automate automate - Security security dsecurity Track various proposed backup improvements - Documented SLA - Automated Restore Tests (infrastructure#2709 and infrastructure#1265) - Time-Delayed Standby infrastructure#2722 - Logical replica - Replace WAL-E (infrastructure#2721) - Change replication auth from password to something else, probably SSL cert - More reliable replication lag monitoring - Use local Azure blob storage instead of AWS S3 when running on Azure - Use a disk for WAL for backups instead of tar files in blobs so it can be snapshotted and mounted directly? - Use LVM or Azure snapshots for backups instead of tar? c.f. BAD: infrastructure#2671 and !346 and
issue