Ensure merge SHA is available during Pipelines for merged results life period

added devopsrelease [DEPRECATED] priority2 severity2 labels

@ogolowinski This is important problem to fix for continuing dogfooding. Can you take a look and prioritize? Let me know if you need further explanation.

changed the description

@dosuken123 Sure - setting milestone for 12.4 Can you tell e the API call to get the SHA, needed for https://gitlab.com/gitlab-org/gitlab-ce/issues/56030

@darbyfrey if this causes a conflict with the deliverables for 12.4 can we discuss pushing out a different item?

@ogolowinski

Can you tell e the API call to get the SHA, needed for https://gitlab.com/gitlab-org/gitlab-ce/issues/56030

https://gitlab.com/gitlab-org/gitlab-ce/issues/56030 seems specifically talking about Releases, on the other hand this issue is talking about Pipelines for merged results. These seems unrelated.

changed milestone to %12.4

added Category:Continuous Delivery grouprelease [DEPRECATED] typebug labels

Make the MR WIP to trigger detached merge request pipelines. This pipeline runs on source branch thus correctly finishes regardless of the target advanced case.

Would that make sense to fallback to the detached pipeline whenever we can't find the SHA we were expecting? That also feels as a workaround as well, given we might fallback to the detached pipeline even in the first pipeline if we're dealing with a busy repository.

@oswaldo The tricky thing is that it happens during the pipeline execution, which means the first job of the pipeline runs on merge commit, however, the latter job of the pipeline could fallback to source ref, this would be very confusing.

mentioned in issue gitlab-com/www-gitlab-com#5177 (closed)

mentioned in merge request gitlab-com/www-gitlab-com!29531 (merged)

added [deprecated] Accepting merge requests label

@dosuken123 To be clear, here's what I had issues with:

Create an MR in www-gitlab-com (e.g. gitlab-com/www-gitlab-com!29549 (merged))
Pipeline fails. A job, build-branch failed with un-related failure (https://gitlab.com/gitlab-com/www-gitlab-com/-/jobs/290694049).
Click Retry on the build-branch job.
Fails with this error below now (https://gitlab.com/gitlab-com/www-gitlab-com/-/jobs/290706305):

Fetching changes with git depth set to 10...
Initialized empty Git repository in /builds/gitlab-com/www-gitlab-com/.git/
Created fresh repository.
From https://gitlab.com/gitlab-com/www-gitlab-com
 * [new ref]         refs/merge-requests/29549/merge -> refs/merge-requests/29549/merge
Checking out dcfcbe39 as refs/merge-requests/29549/merge...
fatal: reference is not a tree: dcfcbe393f25412b41798cf6b518512c2aca8401

@tkuah Thanks for info!

Actually, I think the current situation is not as bad as I thought. Here is what's happening on www-gitlab-com now:

A developer pushes a new commit to a MR
The MR creates merge ref (SHA: AAA)
The MR creates a pipeline that runs on the merge ref.
Runner clones fresh repository because project.build_allow_git_fetch == false. This wipes out all previous merge SHAs stored in the build dir. So it can understand only SHA: AAA.
The target branch is advanced and the MR creates merge ref (SHA: BBB)
The developer retries an existing pipeline which ran on SHA: AAA.
Runner clones fresh repository because project.build_allow_git_fetch == false. This wipes out all previous merge SHAs stored in the build dir. So it can understand only SHA: BBB.

So the point is that, runner removes existing repository everytime when a job runs, however, if we use GIT_STRATEGY: fetch or set "git fetch" to be a default strategy in project-setting i.e. project.build_allow_git_fetch == true, runner doesn't remove the existing repository so that SHA: AAA is available even if the merge ref is rewritten in Rails/Gitaly side.

So it seems to me that our current solution is enabling the following option in www-gitlab-com

Remember, this option chooses git fetch by default, so this would not affect the other projects/organizations as I'd assume www-gitlab-com disables it because it's very old project.

@ayufan @tmaczukin Can you validate the above assumption?

I'd assume the checked out repo in runner side is a best-effort cache and could be wiped at some points, but I think there is enough window to ensure that the merge ref exists in reasonable period.

This is a risky assumption. Runner by default falls-back to clone if it doesn't have local sources and this may happen in many cases. I'd not relay on the fetch strategy for this problem.

@tmaczukin

Runner by default falls-back to clone if it doesn't have local sources and this may happen in many cases

Can you elaborate when this happens? Could the local repo be removed by any chances aside from the git strategy? How do shared runners share the checked-out repo (if strategy is fetch) or is it deleted every time job finished?

Could the local repo be removed by any chances aside from the git strategy?

Yes. Simple example - user manually logged on a machine and deleted the directory. It always may happen that the directory will be removed, for any reason. In that case the script executed by Runner will detect that the director is not existing, will re-create it and switch to cloning.

How do shared runners share the checked-out repo (if strategy is fetch) or is it deleted every time job finished?

It's not shared between different Runners (understood as [[runners]] entry, so the granularity here is smaller than the Runner host!). Within the same [[runners]] worker on the same Runner host it may be preserved, but this highly depends on the used executor and configuration.

Look on GitLab.com example:

Jobs for GitLab CE/EE are using Shared Runners, that are re-using autoscaled VMs up to several jobs. This means that on the VM the code is stored after first clone (in a special Docker volume in our case). But the jobs are distributed across four Shared Runner Managers. On each manager the machines are autoscaled. It means that your job may hit a manager and a VM that already contains a code. But it may happen that the job will get a newly created, fresh machine. In that case Runner must force clone. Otherwise we would be unable to use autoscaled runners.
Other jobs that are using the general Shared Runners are distributed across four Shared Runners Managers configured to drop the autoscaled VM just after one job was handled (for security reason). This means that fetech strategy will never be usable on these Runners. If someone uses our Shared Runners together with Pipelines for Merge Requests, then using fetch is not a solution at all - this workaround is not available for such users.

Yes, I agree with @tmaczukin, it is very strong assumption.

It seems that the problem is that since target branch advances often, we cannot never finish merge train, but it should never happen if only merge train controls when changes are merged into master. If there are different actors advancing the master this will behave exactly as described by @dosuken123.

Isn't that a problem that we use Hybrid approach? Merge train + other actor advancing master?

Maybe the solution is to not advance automatically on target branch change, but rather: when we complete the current merge train, and when we try to check if we can merge, we find a merge unable to happen, and we restart train.

This will result that we will discover the need to refresh merge train when pipeline changes, but it will finish in the end.

@ayufan are you suggesting that the default behavior should be to rerun the merge train in the case where the target branch has advanced while the merge train pipeline is running?

This seems like it could result in a situation where the merge train could rerun many times before it can successfully merge, as you say, because we are using the hybrid approach.

Even given that, this does seem like the correct behavior for this situation. Instead of failing and leaving the process in an unusable state, the system will attempt to recover and the process will succeed eventually. Then, we could potentially address the downsides of the hybrid approach in another issue.

@dosuken123 WDYT?

@tmaczukin Thank you for the detailed answer. So simply put, the probability of running a job in the same machine is 1 / runner_managers.sum(&:number_of_machines) (excluding general one-off cases), which means, even if the number of machines is 1, the chance to avoid clone is 25% at most. This would be unacceptable on www-gitlab-com usage. We'd need a more robust solution than that.

@ayufan @darbyfrey First of all, we're talking about Pipelines for merged results, not Merge Train. Merge Train works fine. But the problem is that developers cannot verify their code before getting it on merge train, because pipelines cannot finish successfully because refs/merge is updated during the pipeline execution.

Given fetch option is not viable solution as discussed above, the next solution would be using refs/train for pipelines for merged results. refs/train is not affected even if the target branch is advanced becasue it's outside of the GitLab refs/merge lifecyle. The ref is not used until the MR gets on the merge train and when the MR is on the train, users basically do not create pipelines for merged results. So there are no conflicts from functionality perspective.

However, there is one limitation that users cannot create multiple MR pipelines in a short interval because every push rewrites refs/train. For example,

A developer pushes a new commit to a merge request. refs/train is rewritten.
A pipeline for merged result starts the job. (Pipeline-A)
A developer pushes a new commit to a merge request. refs/train is rewritten.
A pipeline for merged result starts the job (Pipeline-B). Pipeline-A starts failing because it cannot find the associated SHA on the previous refs/train.

Actually, this makes me think that we don't even need to run pipelines for merged results, because it's evaluated in merge train anyway. So running pipelines on source ref is totally fine while the development.

@ayufan @darbyfrey @ogolowinski @tmaczukin

So I can think of the following solutions today. Let me know what you think.

Solution 1: Create merge refs per pipeline

Proposal:

The problem is that the system has only one merge ref refs/merge and it can be overwritten at an unexpected timing. We should create a dedicated merge ref per pipeline for ensuring that the pipeline can finish until the end.

For example, there is one merge ref today refs/merge-requests/1/merge, but we can create a ref per pipeline, which formatted in refs/merge-requests/<mr-iid>/pipelines/<pipeline-id>/merge. For example,

MR-1, Pipeline-1 ... refs/merge-requests/1/pipelines/1/merge
MR-1, Pipeline-2 ... refs/merge-requests/1/pipelines/2/merge
MR-2, Pipeline-1 ... refs/merge-requests/2/pipelines/1/merge
...

Thoughts:

This is the safest option. Pipelines are not intervened by target/source advanced cases.
Users can retry after the pipeline finished.
Creating refs is not a cheap operation. This could gratly impact Gitaly.
It's hard to determine when the system should garbage collect the old refs.
As the system have more refs, the fetch/clone could be slower.

Solution 2: Always checkout the latest SHA of the merge ref (Runner)

Proposal:

Pipelines are persisted with SHA of the ref. This is for ensuring that all pipeline jobs checkout the same SHA (i.e. the same code). However, this doesn't work well for dynamic refs such as refs/merge, because it's updated at an unexpteced timing.

Give an option to runners, and if it runs on refs/merge, runner always checks out the latest SHA, not the persisted SHA. This means, we'll replace git checkout <SHA> by git checkout refs/merge-requests/1/merge

Thoughts:

This would be one of the cheapest solution from performance standpoint. We can use refs/merge directly.
The shown SHA on the pipeline UI cannot be trusted. The actual SHA used by runner could be different.

Solution 3: Disable Pipelines for merged results by default and make it opt-in

Proposal:

The system creates detached pipelines while development and merge train pipeline when it's merged. But pipelines for merged results do not run unless users explicitly enables the project-level option (which should be turned on only non-busy repos).

Thoughts:

This would be the simplest solution (or more like workaround).
Users cannot use pipelines for merged results by default.

@dosuken123

Thanks for explanation.

Interestingly GitHub does the same, the have a single /merge that represent latest version.

My proposal

I really wonder if it is possible to simply request the given SHA by runner instead of refspec and say that for using merge results, you simply need latest Git. These loose refs will be recycled after sometime. This seems to work:

git fetch origin 3c282a88aa25129c15d219351852e1ea1f2b22b6:test-me

So, we would use a single refs/merge-request/iid/merge, but instead of requesting ref, we could send in refspec exact SHA to request.

There's a small window that these revs would be recycled by GC, but maybe we can create some internal tmp reference for them and remove them after sometime.

Shinya's proposed solutions

Solution 1: Create merge refs per pipeline

This is the only option that seems to make sense for me if the above will not work. One thing to add is to ensure that we cleanup after ourselves. If we gonna have all dangling refs for each pipeline and merge request this gonna make the whole system to struggle.

Creating refs is not a cheap operation. This could gratly impact Gitaly.

We are already doing that, so the cost would be identical, we would only store it elsewhere.

It's hard to determine when the system should garbage collect the old refs. * As the system have more refs, the fetch/clone could be slower.

It should be fine as long as we keep a number of refs minimal, so no dangling refs.

I know that Git has option to fetch specific sha instead of ref. Maybe this could be used on newer versions if possible.

Maybe the simple solution to degenerate pipelines after sometime, like 7 days, and thus we also remove these refs would be fine. We already have the option to archive? builds, but maybe actually executing this now would make a sense. Likely to think in future.

So, maybe just pushing sidekiq job with time in 7 days from now, to remove the given ref would be the way to move forward to fix the explosion of refs? IDK if this is good idea, but some cleanup needs to be there.

I agree with @ayufan.

Proposal 3 is a workaround and will not work for highly used repos where people want (or need to) use Pipelines for MRs.

Proposal 2 breaks our design of using exactly the same code by all jobs in the pipeline. If I have jobs tests 1 and tests 2 with all tests distributed across these two jobs and they will use two different codebases (from two different versions of the merge ref) then how I can say that I've fully tested the code?

Proposal 1 seems to be the only proper solution here. Having some defined time of jobs/pipelines archival connected with removal of such references should not create as big impact on the whole system.

I also like the idea of fetching a specific SHA. In most cases the SHA should be still present at GitLab. @ayufan do you know which version of Git is required to use this?

I also like the idea of fetching a specific SHA. In most cases the SHA should be still present at GitLab. @ayufan do you know which version of Git is required to use this?

Something like 2.x. I don't know. We can easily test that.

The SHA fetching with be the best approach.

Here is some reference: https://stackoverflow.com/a/30701724.

Regarding Proposal 1...

Not sure if this is at all possible, but is there any way to know when a pipeline starts and stops? If we knew that then it seems like we could create the ref when the pipeline starts and remove it when it finishes. That way we could always start with a current sha and wouldn't have to worry about dangling refs.

Maybe this isn't realistic, but wanted to ask.

Not sure if this is at all possible, but is there any way to know when a pipeline starts and stops?

Yes, we know that. It might be just very expensive to generate ref, and it is likely that new internal merge will result in a new sha.

@darbyfrey Also it's also not as easy as "remove the ref after the pipeline is finished", because in that case retrying a job is impossible then. The pipeline has the ref and SHA stored forever and any retry will try to use this pair forever.

Using the ref dedicated for a pipeline that is removed after some time or using the SHA fetching (where the SHA is available until next GC run) gives some additional time for the user to retry the job if wanted/needed. Using a ref that would be removed at the moment when the pipeline is first time transitioned to a finished state, means that any further retry will fail.

That makes sense. I can see why we would want to support the ability to retry. I guess what I was thinking was that if it were possible to create a new ref at the start of the pipeline run and then destroy it at the end, then it could isolate that branch to the specific pipeline run. In the case of a retry, it would just start the process over again by creating a new ref.

However, if creating a new ref for each run is prohibitively expensive, then this probably wouldn't be an option.

@ayufan @tmaczukin @darbyfrey Thank you for the input. I also agree with fetching SHA isntead of featching ref would be the best option. I'll take a look it closely today.

Regarding the ref cleanup timing, we can basically clean it up when MR is closed/merged. It'd gives enough window to users to retry pipelines.

Regarding the ref cleanup timing, we can basically clean it up when MR is closed/merged. It'd gives enough window to users to retry pipelines.

We do not control that, but we could create bogus links that would prevent some shas to be removed for some time. We use that with tmp/ refs today I think.

However, if creating a new ref for each run is prohibitively expensive, then this probably wouldn't be an option.

@darbyfrey It is not only expensive, it is practically impossible to produce exact same sha. You would need to ensure that you the exact same timing of operation, and likely some other factors as well. It means that this would be flaky to ensure that we always get the same SHA. Pipeline is created for given SHA, so we cannot change SHA mid-way.

@ayufan @darbyfrey @tmaczukin @stanhu

I was investigating the @ayufan's proposal that fetches specific SHA without specifying refspecs. This in not currently available because gitlab.com doesn't allow users to set uploadpack.allowAnySHA1InWant option today, which is required to be true if we want to accomplish that.

However, there are some on-going discussions to enable the option on gitlab.com, which seems happening for %12.3 release. If we can ride on the wave, this issue would be basically easy fix that we only changes Rails side to send +<SHA>:<merge-ref> as refspec to runners. One note is that we don't enable it for all projects/users immediately that it's basically expensive operations thus there are some performance concerns. It could take a long time in the worst case scenario.

References

If we want to stick with a simple and performatn architecture, we'd want to wait for allowAnySHA1InWant option is being rolledout, otherwise, we have to go for the solution 1 I proposed that makes merge refs per pipeline.

I personally would wait for uploadpack.allowAnySHA1InWant.

Maybe if this gonna be feature flag on project side, maybe we could detect that allowAnySHA1InWant is used and simply use it or use existing refs/.../merge (somehow broken). If we follow that, it will be self-solved for all projects that have it enabled.

There is an alternative approach that downloading source code via archive API. Something like:

# In runner
wget -O archive.zip https://gitlab.com/gitlab-com/www-gitlab-com/-/archive/3674ffc3967c73bfd160d640be151ab590d59e2a
tar -xvf archive.zip
mv archive <builds-dir>

This way, we don't need to specify refs. Since it's just a set of files, users cannot use git command in the builds dir, which could be a significant limitation thus our efforts wouldn't be paid off.

Archives might take a long time to generate, and we'd have to poll when they are ready. They'd also cause disk space to balloon for lots of CI builds.

This is bad idea to download archive, as CI often require a working git tree.

I added this comment: https://gitlab.com/gitlab-org/omnibus-gitlab/issues/4620.

I wonder. Maybe we could still "link" the relevant refs on pipeline lifecycle. This will just generate quite amount of pressure on info/refs being often discarded if we cache it. It should be possible, since the sha should be hold by keep-around it should work to dynamically link it to expected exposed ref, and remove it once pipeline transitions to state success.

Maybe, even do it on build transition to pending, and cleanup it up on pipeline transition to finished state.

It seems that we could hook that after_transition pending => :running and ensure that given ref does exist for build. If not, link it to :sha. If we cannot link :sha we could fail this build with appropriate failure_reason.

Then, if pipeline transitions to finished state, we could remove this auto-generated ref.

This should pretty well handle retries as well, and not require to use git fetch SHA.

@ayufan Actually, I came up with yet another approach.

Solution 5: Retain history on the ref

We use refs/merge-requests/:iid/retain, not refs/merge-requests/:iid/merge.
When users push a new commit or create a pipeline for merged results, it generates refs/retain from the HEAD of target and source. If the ref has already existed, continue to the next bullets.
It merges the source branch into the refs/retain (or cherry-picks the diff commits), if it's advanced.
It merges the target branch into the refs/retain (or cherry-picks the diff commits), if it's advanced.
It creates a pipeline on refs/retain with the latest sha.
If force-push happened on source or target branch, it regenerates refs/retain.

Thoughts:

The number of refs will not significantly increase, as compared to the create-ref-per-pipeline approach.
Additional git operations wouldn't be cheap.
If force-push/squash happens, the previous pipelines would fail. (But this is already the case with branch pipelines)

I think that it is in general nice feature that we could "we control" force-push behavior, so, I think dynamically creating refs would be likely better solution.

You still have the problem with git_depth, as a lot of new commits can disallow us fetching older revs.

I'm still uncertain if we will not have merge conflicts even if not a force push happens, but regular push, where in other cases we would merge things just fine (unlikely, but who knows).

In general this seems interesting, but I still wonder if creating dynamically would simply not be better.

Maybe (and remove it after):

refs/pipelines/iid

Here is a rough assumption if we go for creating ref per pipeline and advertise it e.g. refs/pipelines/iid. The number of refs increase around 1000 on gitlab-org/gitlab project every day, for instance.

[ gprd ] production> Ci::Pipeline.where(project_id: 278964).where(created_at: 4.days.ago..3.days.ago).count
=> 822

If we retain the dedicated refs for 7 days (for retry purpose, mainly), the additional number of advertised refs would be around 7000, always.

Here is the comparison to the other ref types.

shinya@shinya-MS-7A34:~/workspace/thin-gdk/service/rails/src$ grep -a 'refs/merge-requests/' /home/shinya/Downloads/refs | wc -l
14272
shinya@shinya-MS-7A34:~/workspace/thin-gdk/service/rails/src$ grep -a 'refs/environments/' /home/shinya/Downloads/refs | wc -l
32578
shinya@shinya-MS-7A34:~/workspace/thin-gdk/service/rails/src$ grep -a 'refs/heads/' /home/shinya/Downloads/refs | wc -l
2120

I wonder if we should schedule #26201 (closed) as well for reducing the total advertised ref sizes and make some room for the new pipeline-dedicated-refs.

@ayufan @stanhu Thought?

Can we remove the ref after pipeline is marked as finished?

We would create this ref (it would be used only by refspec) on Ci::Build: pending -> running, remove it on Ci::Pipeline -> [finished].

This way we would expose up-to number of running pipelines only.

@ayufan How can users retry the pipeline?

I think we should invest some time on removing environments refs.

Does anyone even use this? Maybe we should remove it from the info/refs?

Users do not retry pipeline, but rather build, so sha will be re-linked to the ref.

I don't really understand your full vision. Currently, build.ref are directly referring to pipeline.ref

pipeline.ref ... refs/pipelines/:iid
build.ref ... pipeline.ref

This means, when we remove the ref, the job cannot be retried again. As build.ref is used in the refspec directly.

Well, actually I think refs/pipelines/:sha might be better in this case. This way, pipelines which run on the same SHA use the same ref, thus the total number of refs would be smaller.

@ayufan If you don't have a good idea about expiration policy, I'll invest time for the last proposal I made which introduces refs/retain.

@dosuken123

The expiration policy is simple. Hook to PipelineFinishedWorker as described in: #14863 (comment 217553367).

The simplest, and the most stable. I would consider that if we hook to pipeline lifecycle, and have a common sha, like refs/pipelines/:sha then we have multiple pipelines managing this sha, this will result in race conditions. Having then refs/pipelines/:iid would be more stable in such cases, as you are sure that you are only controlling the pipeline.

@dosuken123

I was thinking of something like that, but for PipelineSuccess likely to be async job, that is likely also delayed by maybe 1h?

diff --git a/app/models/ci/build.rb b/app/models/ci/build.rb
index 71c4501f57ba..2154e4aed1e2 100644
--- a/app/models/ci/build.rb
+++ b/app/models/ci/build.rb
@@ -234,6 +234,8 @@ module Ci
       after_transition pending: :running do |build|
         build.deployment&.run
 
+        build.pipeline.ensure_persistent_ref
+
         build.run_after_commit do
           BuildHooksWorker.perform_async(id)
         end
diff --git a/app/models/ci/pipeline.rb b/app/models/ci/pipeline.rb
index 241b87dc7749..f0feef78922b 100644
--- a/app/models/ci/pipeline.rb
+++ b/app/models/ci/pipeline.rb
@@ -182,6 +182,12 @@ module Ci
         end
       end
 
+      after_transition any => ::Ci::Pipeline.completed_statuses do |pipeline|
+        pipeline.run_after_commit do
+          pipeline.delete_persistent_ref!
+        end
+      end
+
       after_transition any => [:success, :failed] do |pipeline|
         pipeline.run_after_commit do
           PipelineNotificationWorker.perform_async(pipeline.id)
@@ -918,6 +924,28 @@ module Ci
       end
     end
 
+    def persistent_ref
+      return unless if merge_request_ref?
+
+      "refs/pipelines/#{id}"
+    end
+
+    def ensure_persistent_ref
+      return unless project
+      return unless persistent_ref
+
+      project.repository.create_ref(persistent_ref, sha)
+    rescue Gitlab::Git::CommandError
+    end
+
+    def delete_persistent_ref!
+      return unless project
+      return unless persistent_ref
+
+      project.repository.delete_refs(persistent_ref)
+    rescue Gitlab::Git::CommandError
+    end
+
     def latest_builds_status
       return 'failed' unless yaml_errors.blank?
 
diff --git a/app/presenters/ci/build_runner_presenter.rb b/app/presenters/ci/build_runner_presenter.rb
index 5231a8efa551..dc1e5c739de6 100644
--- a/app/presenters/ci/build_runner_presenter.rb
+++ b/app/presenters/ci/build_runner_presenter.rb
@@ -79,15 +79,19 @@ module Ci
     end
 
     def refspec_for_branch(ref = '*')
-      "+#{Gitlab::Git::BRANCH_REF_PREFIX}#{ref}:#{RUNNER_REMOTE_BRANCH_PREFIX}#{ref}"
+      ref = "#{Gitlab::Git::BRANCH_REF_PREFIX}#{ref}"
+
+      "+#{pipeline.persistent_ref || ref}:#{RUNNER_REMOTE_BRANCH_PREFIX}#{ref}"
     end
 
     def refspec_for_tag(ref = '*')
-      "+#{Gitlab::Git::TAG_REF_PREFIX}#{ref}:#{RUNNER_REMOTE_TAG_PREFIX}#{ref}"
+      ref = "#{Gitlab::Git::BRANCH_REF_PREFIX}#{ref}"
+
+      "+#{pipeline.persistent_ref || ref}:#{RUNNER_REMOTE_TAG_PREFIX}#{ref}"
     end
 
     def refspec_for_merge_request_ref
-      "+#{ref}:#{ref}"
+      "+#{pipeline.persistent_ref || ref}:#{ref}"
     end
 
     def git_depth_variable

@ayufan Ah, OK. Now I got your intention. It works certainly. So sums up:

The system ensures a corresponding pipeline ref exists when build status transits to running.
Runners fetch code from the pipeline ref instead of source branch, refs/head or refs/merge.
The system cleans up the pipeline ref when pipeline finished i.e. no more active jobs.

Yes :) It should be safe, and quite fast. It will work as long as we have sha, we should have it as it should be hold by keep-arounds.

Changed milestone to 12.3

changed milestone to %12.3

mentioned in issue #12850 (closed)

assigned to @dosuken123

changed the description

added to epic &1881 (closed)

marked this issue as related to gitlab-com/www-gitlab-com#5177 (closed)

removed [deprecated] Accepting merge requests label

changed the description

ZD: https://gitlab.zendesk.com/agent/tickets/129416 (internal)

added customer label

changed the description

mentioned in merge request !17043 (merged)

This was found in %12.3 but not resolved yet. Moving forward to %12.4 as a Deliverable item.

changed milestone to %12.4

added Deliverable label

@dosuken123 Can you update the description with the final proposal?

The latest proposal has already been reflected. If you have any questions, let me know.

added workflowin dev label

marked this issue as related to #32741 (closed)

added workflowverification label and removed workflowin dev label

changed the description

The MR !17043 (merged) has been merged. We're currently evaluating this behavior in www-gitlab-com and gitlab on gitlab.com.

changed the description

mentioned in merge request !18185 (merged)

changed the description

@dosuken123 Thanks @dosuken123 for all your hard work!

closed via merge request !18185 (merged)

mentioned in commit 728bdaab

mentioned in issue #31729 (closed)

mentioned in issue #273432 (closed)

mentioned in epic &7191 (closed)

mentioned in issue #500159 (closed)

Timeline	Action
Oct 9th 3:00 AM UTC	The feature flag `depend_on_persistent_pipeline_ref` is enabled on all projects on production
Fri Oct 4 07:15:43 UTC 2019	The feature flag `depend_on_persistent_pipeline_ref` is enabled on gitlab-com/www-gitlab-com and gitlab-org/gitlab on production

Ensure merge SHA is available during Pipelines for merged results life period

TODO

Status

Problem

Workaround

Proposal

Designs

Child items ...

Activity

My proposal

Shinya's proposed solutions

Ensure merge SHA is available during Pipelines for merged results life period

TODO

Status

Problem

Workaround

Proposal

Relates to

Activity

My proposal

Shinya's proposed solutions