Service to create incident or correlate to existing for pipeline failures

Proposal

To support Dogfooding Health components we will need a function to process pipeline webhook events and do the following:

  • Identify if the failure is known or new
    • Create an incident if the failure is new
    • Correlate to known incident if failure is known

Questions

  • How do transient failures fit into the workflow?