Update Ops and Monitor Visions

In response to this MR and video.

  • Clarify that the dashboard user workflows we want to see is one more akin to Datadog (setup alert thresholds on metrics, and respond/triage) as opposed to how Grafana is typically used (stare at a dashboard in a NOC) in the Ops Vision
  • Provide the same clarification as well as an updated vision in the Monitoring
  • Provide some rough workflow sketches for the desired experiences in our monitoring workflows as part of that monitor update - Due EOD 10/4
    • Instrumentation
    • Triaging
    • Resolve
    • Improve
    • Review - Kenny
    • Review - Sid
  • Review the Monitoring team roadmaps to ensure they are aligned with this updated vision
    • Health - @sarahwaldner
    • APM - @dhershkovitch
    • All - @kencjohnston, Scott, William Chia, Aaron White
  • Accelerate Ops PM hires
  • Prepare FY2021 Ops investment plans to enable the completion of this vision

References

  • Monitor Vision and Workflow Discussion
Edited Oct 21, 2019 by Kenny Johnston
Assignee Loading
Time tracking Loading