Measure and act on flaky specs
From [2021-04-20 EP Team meeting](https://docs.google.com/document/d/1yzHuosvykzb_kaldjGP9I6wxpNpMXdNEvEFBkHyZpXs/edit#bookmark=id.6jo5t4fefkda) we want to focus in Q2 on reducing the impact that flaky specs cause to pipeline stability that affect productivity of the Engineering team ## Benefits - Improve productivity - MR merge time - less retries needed during MWPS - Master stability improvement without manual action - Removes doubts on whether master is broken or not and default action of retry. - Defining acceptable thresholds for action like quarantining/focus on refactoring. - Step towards unlocking merge train ## Questions - What are some of the actions that should be taken based on this information? - What data is needed to deterministically take those actions? - How should data be gathered/stored to ensure actionability can be automated?
epic