XLOG generation peak

All times in UTC.

  • 5:39pm: PagerDuty alert: "Postgres is generating XLOG too fast, expect this to cause replication lag" https://gitlab.pagerduty.com/incidents/PGXBWQH
  • 5:44pm: Peak corroborated. Already starting to trend down Captura_de_pantalla_2018-01-21_a_la_s__14.43.50 Some other anormalities at the time:
    • Postgres IO Utilization Captura_de_pantalla_2018-01-21_a_la_s__14.49.23
    • NFS timeout peak for nfs-file-12 Captura_de_pantalla_2018-01-21_a_la_s__14.48.53
  • 5:49pm: Alert resolved. XLOG generation levels back to normal: Captura_de_pantalla_2018-01-21_a_la_s__14.56.41

/cc @gl-infra