Try out new circuitbreaker updates in canary

The circuitbreaker has been adjusted on several fronts:

https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/15426: The check now happens inside a separate request to the unicorn of each host triggered each second by a process running on the host. This means the check does not need to be performed each request.
https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/15612: Improved metrics for the storage checks will be available as soon as the process is running, so without actually preventing access to storage.
https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/15613: The health page should open again.

We should try this out again after 10.3 is deployed.

What are we going to do?

Try out the changes to the circuit breaker in canary.

To validate the new behaviour, in order to be able to turn it on in production

TBD

Enable the process that calls out to the unicorn to perform access checks
Check that the health page opens: https://gitlab.com/admin/health_check
Check that metrics are coming into Prometheus, (variable name is circuitbreaker_storage_check_duration_seconds)
Use the network gnome to block access to the NFS where gitlab-ce lives (https://gitlab.com/gl-infra/network-gnome).
Check that the failure count goes up on the health page: https://gitlab.com/admin/health_check
Check the metrics for the failing storage is Prometheus.
Enable the circuitbreaker: (https://dev.gitlab.org/cookbooks/chef-repo/merge_requests/1269)
Check that some pages of gitlab-ce are still accessible
Check that repo of other projects is still accessible.
Re-enable access to the NFS shard
Check that the service recovers.

We need to enable this setting on all the nodes of the canary fleet that have unicorns installed.

https://canary.gitlab.com/gitlab-org/gitlab-ce should break on the repository page, other pages should still work. https://canary.gitlab.com/fdroid/fdroidclient/ should remain available.

Revert the changes in the cookbook

Schedule a downtime in the production calendar twice as long as your worst duration estimate, be pessimistic (better safe than sorry)