Change: reattempt running Network Gitaly in Production

For more background on when this template should be used, see the infrastructure handbook.

Create a google doc to track the progress. This is because in the event of an outage, Google docs allow for real-time collaboration, and don't depend on GitLab.com being available.
- Add a link to the issue where it comes from, copy and paste the content of the issue, the description, and the steps to follow.
- Title the steps as "timeline". Use UTC time without daylight saving, we all are in the same timezone in UTC.
- Link the document in the on-call log so it's easy to find later.
- Right before starting the change, paste the link to the google doc in the #production chat channel and "pin" it.
Discuss with the person who is introducing the change, and go through the plan to fill the gaps of understanding before starting.
Final check of the rollback plan and communication plan.
Set PagerDuty maintenance window before starting the change.

Before starting the Change
- Tweet to publicly notify that you are performing a change in production following the guidelines.
Start running the changes. When this happens, one person is making the change, the other person is taking notes of when the different steps are happening. Make it explicit who will do what.
When the change is done and finished, either successfully or not
- Tweet again to notify that the change is finished and point to the change issue.
- Copy the content of the document back into the issue redacting any data that is necessary to keep it blameless and deprecate the doc.
- Perform a quick post mortem following the Blameless Postmortem guideline in the infrastructure handbook in a new issue.
- If the issue caused an outage, or service degradation, label the issue as "outage".

Edited May 31, 2017 by Andrew Newdigate