Fix geo-install job failure to start sshd
Overview
If we can resolve the following intermittent error, then we can make the geo-install
job not allowed to fail.
From !3285 (comment 1497782609):
Currently seeing
geo-install
fail with:timeout: down: /home/gdk/gdk2/services/sshd: 1s, want up ...snip... make: *** [Makefile:263: start] Error 1 ERROR: Failed to install.] ERROR: 'cd /home/gdk/gdk2 && gdk install gitlab_repo=/home/gdk/gdk/gitlab' failed. GeoSecondaryHelpers::Abort ./support/geo-add-secondary:278:in `run_command_arr' ./support/geo-add-secondary:156:in `configure_secondary' ./support/geo-add-secondary:115:in `execute' ./support/geo-add-secondary:337:in `<main>'
And in the
sshd
logs:2023-08-01_03:00:42.58057 [36;1msshd : [0mtime="2023-08-01T03:00:42Z" level=fatal msg="failed to load configuration from specified directory" error="open /home/gdk/gdk2/gitlab-shell/.gitlab_shell_secret: no such file or directory"
Impacted categories
The following categories relate to this issue:
-
gdk-reliability - e.g. When a GDK action fails to complete. -
gdk-usability - e.g. Improvements or suggestions around how the GDK functions. -
gdk-performance - e.g. When a GDK action is slow or times out.
Steps to replicate (optional)
Proposal (optional)
Environment (optional)
- Operating system name:
<!-- output of `uname -a` command -->
- Architecture:
<!-- output of `arch` command -->
- The contents of your
gdk.yml
(if any) - Ruby version:
<!-- output of `ruby --version` command -->
- GDK version:
<!-- output of `git rev-parse --short HEAD` command -->
Edited by Michael Kozono