[META] Database Backup/Redundancy Improvements
Goals:
- Faster/More Frequent Staging Refreshes (infrastructure#2718)
- Improve SLA numbers
- Automate automate automate
- Security security dsecurity
Track various proposed backup improvements
- Documented SLA
- Automated Restore Tests (infrastructure#2709 and infrastructure#1265)
- Time-Delayed Standby infrastructure#2722
- Logical replica
- Replace WAL-E (infrastructure#2721)
- Change replication auth from password to something else, probably SSL cert
- More reliable replication lag monitoring
- Use local Azure blob storage instead of AWS S3 when running on Azure
- Use a disk for WAL for backups instead of tar files in blobs so it can be snapshotted and mounted directly?
- Use LVM or Azure snapshots for backups instead of tar?
c.f. BAD: infrastructure#2671 and !346 and
issue