Follow-up from "Remove :failed category when receiving a non-success webhook log"
About
This issue was created from this discussion !166329 (comment 2402019512):
We set the
backoff_countto be3less than therecent_failuresbecause we only incrementbackoff_countafter we reach3recent_failures.You can see this relationship between the two numbers in the logs https://log.gprd.gitlab.net/app/r/s/7LCpF.
As a follow-up I think we can remove
backoff_countand just refer torecent_failures(minusFAILURE_THRESHOLDfrom it where we would normally refer tobackoff_count).
Proposal
The first MR !187077 (merged) to remove backoff_count was merged in 17.11.
The next step will be 18.2 - the next required stop, as we will need to finalize the MigrateOldDisabledWebHookToNewState background migration that is setting backoff_count for records.
The remaining work is in 18.2 or later:
- Finalize
MigrateOldDisabledWebHookToNewStatehttps://docs.gitlab.com/development/database/batched_background_migrations. This will ensure that our migration to drop thebackoff_countwill not interfere with any remaining batches from that migration. - Drop
backoff_counthttps://docs.gitlab.com/development/database/avoiding_downtime_in_migrations/#dropping-the-column-release-m1
In the milestone after we drop backoff_count: Remove the ignore rule https://docs.gitlab.com/development/database/avoiding_downtime_in_migrations/#removing-the-ignore-rule-release-m2