Upgrade kubernetes runner application via kubernetes integration

added to epic &253

added ~4196707 Category:Kubernetes Management labels

marked this issue as related to #49952 (closed)

ZD: https://gitlab.zendesk.com/agent/tickets/104079 (internal)

added customer label

@danielgruesso is there any progress on this issue?

We installed runner via gitlab to kubernetes on GKE yesterday and it installed runner v 10.3.0 (2017-12-22), this is a really big issue.. please fix asap. Kubernetes integration is not very usable given outdated versions of things get installed

Can we make the runner version to always use the latest runner version, the same version as version of GitLab, since the version is in sync?

From the 11.3 release going forward we will always update the runner version per https://gitlab.com/charts/gitlab-runner/issues/45. @tmaczukin, thoughts on updating the live one retroactively with the 11.2 version? Easy to do?

added devopsverify + 1 deleted label

@tmaczukin @jlenny @ayufan Given a large majority of consumers of the runner helm chart is via the gitlab-ce integration, this issue is why I think we need to do https://gitlab.com/charts/gitlab-runner/issues/45#note_102838515 (which is admittedly a manual step).

Make the helm chart use the latest runner version
Make gitlab-ce use the latest helm chart version (bonus if we can sync all these versions to be the same version as GitLab)

@tkuah Yes, it makes a lot of sense.

For a reference, let's also link to this issue, where a similar discussion was done: charts/gitlab-runner#45.

thoughts on updating the live one retroactively with the 11.2 version? Easy to do?

@jlenny Since I'll be preparing a version with 11.3.0 tomorrow (as in gitlab-runner#3579 (closed)) I don't think we need another MR that would contain the update to 11.2 (which will also block the merge of update to 11.3 in case of any problems). Let's just start syncing the versions with 11.3.0.

Works for me - thanks @tmaczukin

Just installed the runner myself and it's outdated, any idea how to upgrade it?

added 1 deleted label and removed 1 deleted label

mentioned in merge request !22829 (closed)

mentioned in issue #54160 (closed)

@danielgruesso @DylanGriffith just reporting here what already said in other issues to keep the SSOT.

This feature seems to be more and more important, as we are introducing new syntax (e.g., for reports) that will make old runners not able to run the related jobs.

People that are using our integration to run CI/CD will have hard times by accessing those new features if we don't allow to update the existing runners in a easy way.

Any chance it could be prioritized in the near future? Thanks!

It seems the chart has already been updated and the only thing left would be to use the new chart version in the k8s integration, is my read here correct @tkuah @DylanGriffith ?

@danielgruesso Updating the chart version in the k8s integration is indeed done but I think this is about upgrading pre-existing installations

changed milestone to %11.7

Targeting for 11.7

added [deprecated] Accepting merge requests label

@danielgruesso @tauriedavis

Ideally we should apply the same method to upgrade the runner to all the gitlab managed apps. Some questions

Is it an automatic upgrade or do we require the user to explicitly hit an upgrade button ?
Is there a suitable place to show the current Helm chart and application version that is installed ?

Is it an automatic upgrade or do we require the user to explicitly hit an upgrade button ?

Is there the risk of something breaking for the user? We should automatically upgrade imo for I wonder if further consideration for backward compatibility should be made.

Is there a suitable place to show the current Helm chart and application version that is installed ?

I picture an "app info" hover that provides this information when the user requires it.

Is there the risk of something breaking for the user? We should automatically upgrade imo for I wonder if further consideration for backward compatibility should be made.

I think the best thing for us to do is be opinionated and keep things up to date with the versions we set and test in GitLab. Doing this automatically seems preferable to me but we'd just need some way to present the information to a user if the upgrade failed for some reason. Also a failed upgrade may cause something to be broken in their cluster so we'd maybe even want to email them if they're previously working Ingress just got upgraded and is now broken (ie. traffic is no longer reaching their app).

I think being opinionated is important because if people want to manage all of this stuff (and disagree with our opinions) then they can always just install themselves outside of GitLab but if they install through GitLab we should not put a burden on them to keep things up to date.

I imagine we could have a few details here:

Display the installed version in GitLab's UI
Regular schedule to check your installed applications and update them if they are out of date
Display upgrade errors somewhere
Alert users if an upgrade fails (maybe the user that installed the application should get an email)

I think the best thing for us to do is be opinionated and keep things up to date with the versions we set and test in GitLab. Doing this automatically seems preferable to me but we'd just need some way to present the information to a user if the upgrade failed for some reason

I agree, let's for automatic. I think we can alert users to check the logs of the pod used to upgrade, very similar to how we do this for installs.

Also a failed upgrade may cause something to be broken in their cluster so we'd maybe even want to email them

Notification is going to be interesting. Do we pro-actively notify or simply leave a error message in the cluster page for that application ?

I agree with being opinionated and upgrading to the latest version.

If the upgrade fails, will we automatically retry? How many times? If it continues to fail, what will the user need to do to mitigate the situation?

If the upgrade fails, will we automatically retry? How many times?

Yes, we should retry. Sometimes there are temporary connection errors which means re-trying automatically will resolve the situation. We can re-try as many times as we prefer.

If it continues to fail, what will the user need to do to mitigate the situation?

Once the user have fixed the issue causing the upgrade to fail, can they have a button to try the Upgrade again ? That seems sensible to me.

If an upgrade fails, how feasible is it to roll back so that we don't break something in their cluster?

If an upgrade fails, how feasible is it to roll back so that we don't break something in their cluster?

Good question. I think it's feasible as long as we know the previous version to rollback to, which we should have in the history. Are you thinking of an automatic rollback in event of failure ?

My guess is that Helm charts mainly use Kubernetes Deployments so an upgrade will not break things in the sense that the old version will keep running until a new version can successfully start. Of course, that doesn't mean it's foolproof so we can still rollback with Helm rollback.

Yeah, we should still notify the user but if we can do something to roll back that prevents errors in their cluster I would advocate for that :)

changed title from Provide a method for upgrading kubernetes runner to Provide a method for upgrading kubernetes runner application via kubernetes integration

changed the description

changed title from Provide a method for upgrading kubernetes runner application via kubernetes integration to Upgrade kubernetes runner application via kubernetes integration

added UX label

added Deliverable label

assigned to @tauriedavis

removed [deprecated] Accepting merge requests label

@tauriedavis @danielgruesso this seems to have both the ~Configure and Verify department labels. By which department will this be picked up?

cc: @rverissimo (FYI we have "department" labels and "product area" labels Verify and ~"devops:verify" )

~Configure is definitely planning to pick this up in 11.7 as discussed in ~Configure planning

removed 1 deleted label

mentioned in issue gitlab-runner#3807 (closed)

mentioned in issue gitlab-org/release/tasks#593 (closed)

assigned to @tkuah

added backend label

Can we assume users will have stock standard configurations ?

Are there any users who may have edited their helm applications after they have installed via GitLab managed apps ? (Feels increasingly difficult, since mutual auth)

changed the description

marked this issue as related to #55544 (moved)

Have raised https://gitlab.com/gitlab-org/gitlab-ce/issues/55544 as a pre-requisite for this issue

Upgrade kubernetes runner application via kubernetes integration

Problem to solve

Further details

Solution

Application upgrade

Application fails to automatically update

What does success look like, and how can we measure that?

Links / references

Followup issues

Designs

Child items ...

Activity

Upgrade kubernetes runner application via kubernetes integration

Problem to solve

Further details

Solution

Application upgrade

Application fails to automatically update

What does success look like, and how can we measure that?

Links / references

Followup issues

Relates to

Activity