gitlab-org/gitlab project returning 404
Please note: if the incident relates to sensitive data, or is security related consider labeling this issue with security and mark it confidential.
Summary
Accidental delete fo the gitlab-org/gitlab project - working on restoring the project.
Service(s) affected : gitlab-org/gitlab project Team attribution : Minutes downtime or degradation :
Timeline
2019-09-24
- 00:10:53 UTC - accidental api request for delete
- 00:12 UTC - reports of 404 on gitlab-org/gitlab
- 00:13 UTC - identified the issues- a team member accidentally marked the project for delete when trying to delete another artifact using the API
- 00:30 UTC - found project in DB as marked for delete- stopping all background processing temporarily
- 00:31 UTC - found project on file server in moved/marked for delete state - backing up
- 00:34 UTC - Looking for the job that would do the delete to make sure it does not process the delete.
- 00:51 UTC - Removing the job that would do the delete
- 00:54 UTC - gitlab-org/gitlab is back with error "This project was scheduled for deletion, but failed with the following message: Sidekiq::Shutdown" due to the actions at 00:30 UTC
- 01:00 UTC - we removed the primary delete job.
- 01:10 UTC - restarted all but best-effort sidekiq nodes (which process deletes)
- 01:10 UTC - working to remove all other delete jobs from the sidekiq queues
- 01:20 UTC - restarted best-effort sidekiq nodes
- 01:45 UTC - the job retried in sidekiq and the project is again 404ing
- 01:50 UTC - stopped all sidekiq jobs and changing user permissions for the job attempting the delete.
- 01:59 UTC - moved back git files on the file stor node and project is not 404 ing
- 02:00 UTC - restarting sidekiq, with permissions changed for user that would perform the delete
- 02:40 UTC - all pending jobs appear to have been cleared, object pools and banners look okay. marking as resovled.