Tooling for GKE troubleshooting does not appear to work
Details
- Point of contact for this request: @skarbek, @msmiley
- If a call is needed, what is the proposed date and time of the call: n/a
- Additional call details (format, type of call): n/a
SRE Support Needed
The tooling documented at: https://gitlab.com/gitlab-com/runbooks/-/blob/master/docs/kube/k8s-adhoc-observability.md are suffering from problems that prevent them from running.
- The scripts associated with package capturing get stuck shortly after setup of the toolbox. There's no output, so it is unknown where the hangup actually is
- The profiling scripts capture data, but then toolbox appears to hang doing something thus we must kill the script in order to proceed forward
- Both types of scripts get stuck during the initial toolbox setup because the apt repo information of toolbox is out of date. An
apt-update
is necessary prior to installing the packages required by these scripts