Perform various changes to PostgreSQL of which some require downtime

There is a whole list of configuration changes that we need to apply to PostgreSQL. Most of these don't require downtime, but the following do:

The following changes I would like to apply don't require downtime:

For more background on when this template should be used, see the infrastructure handbook.

Set max_connections to 300 ⚠ This must be done on the primary first ⚠
Update the GitLab package on the database servers to upgrade PostgreSQL to 9.6.3, then restart PostgreSQL ⚠ This must be done on the primary first ⚠
Set random_page_cost to 2
Set max_locks_per_transaction to 128
Set log_temp_files to 0
Set log_checkpoints to on
Set log_min_duration_statement to 1000

This requires a careful set of steps to make sure that all databases play nice:

Create a google doc to track the progress. This is because in the event of an outage, Google docs allow for real-time collaboration, and don't depend on GitLab.com being available: https://docs.google.com/document/d/164-kq8LdtuP-qNt9E8JP-AZVb4m3ekX4DpYKxGYJhTU/edit#
- Add a link to the issue where it comes from, copy and paste the content of the issue, the description, and the steps to follow.
- Title the steps as "timeline". Use UTC time without daylight saving, we all are in the same timezone in UTC.
- Link the document in the on-call log so it's easy to find later.
- Right before starting the change, paste the link to the google doc in the #production chat channel and "pin" it.
Discuss with the person who is introducing the change, and go through the plan to fill the gaps of understanding before starting.
Final check of the rollback plan and communication plan.
Set PagerDuty maintenance window before starting the change.

Before starting the Change
- Tweet to publicly notify that you are performing a change in production following the guidelines.
Start running the changes. When this happens, one person is making the change, the other person is taking notes of when the different steps are happening. Make it explicit who will do what.
When the change is done and finished, either successfully or not
- Tweet again to notify that the change is finished and point to the change issue.
- Copy the content of the document back into the issue redacting any data that is necessary to keep it blameless and deprecate the doc.

Edited Jun 18, 2017 by Jason Tevnan