Enable `dynamic_image_resizing_owner` FF via production console for pre-selected 100 users

Production Change

Change Summary

We want to run the Dynamic Image Resizing experiment.
To do that, we need to enable the Feature Flag (`dynamic_image_resizing_owner) for 100 of pre-selected users (a semi-random subset of GL employees).

It will only affect a randomly selected subset of Gitlab Employees' avatars which would be resized in our service before being served. Content images are not affected. We expect there will be no difference in User Experience. We want to monitor the Workhorse performance, stability, and resource utilization.

Change Details

  1. Services Impacted - GL Workhorse
  2. Change Technician - @iroussos
  3. Change Criticality - C3
  4. Change Type - changeunscheduled
  5. Change Reviewer - @alipniagov
  6. Due Date - 10 Sep, 2020, 12:00 UTC
  7. Time tracking - 240 minutes max (180 minutes probably)
  8. Downtime Component - No

Detailed steps for the change

Pre-Change Steps - steps to be completed before the execution of the change

Estimated Time to Complete (mins) - 15

  • Start a sync call with the Memory Team members: Aleksei Lipniagov, Nikola Milojevic, Matthias Kappler, Kamil (if available), and make sure that everyone is on board. Ideal start time would be somewhere in between of 9:00 GMT and 13:00 GMT, but we could discuss.

Change Steps - steps to take to execute the change

Estimated Time to Complete (mins) - 30

Post-Change Steps - steps to take to verify the change

Estimated Time to Complete (mins) - 180

  • Observe the WH performance: CPU, Memory, latency. Verify that there are no visible changes.
  • Check related Prometheus metrics
  • Check related logs in Kibana
  • Post in #production (Slack): /chatops run feature set dynamic_image_resizing_requester 5
  • Check WH, Prometheus, Kibana
  • Increase the dynamic_image_resizing_requester percentage up to 100
  • Check WH, Prometheus, Kibana

Rollback

Rollback steps - steps to be taken in the event of a need to rollback this change

Estimated Time to Complete (mins) - 5

  • Post in #production (Slack): /chatops run feature delete dynamic_image_resizing_requester
  • Verify FF is deleted (copy the Slack link here)
  • Post in #production (Slack): /chatops run feature delete dynamic_image_resizing_owner
  • Verify FF is deleted (copy the Slack link here)

Monitoring

Key metrics to observe

Summary of infrastruture changes

  • [-] Does this change introduce new compute instances? - No
  • [-] Does this change re-size any existing compute instances? - No
  • [-] Does this change introduce any additional usage of tooling like Elastic Search, CDNs, Cloudflare, etc? - No

Changes checklist

Edited by Aleksei Lipniagov