Reliability - Observability Q2 discussion issue

With the overall goals in OKRs now more described in https://docs.google.com/document/d/1ztz-Vj8mj-pKlM7QUAsSl2IXyIaMKaQolOtwE0JKhMI/edit#, creating this issue to update our Vision and gather ideas for what we are focusing on in Q2.

From discussions in 1:1s:

  1. we plan to focus first on how we enable success for both AI and more ephemeral environments to ship metrics to our platform.
  2. Take care of some maintenance items for keeping systems up to date
  3. Focus on spend reduction via reducing usage in our metrics and logging buckets.