PoC job concurrency limit for explorative purpose (!100383) · Merge requests · GitLab.org / GitLab

Fabio Pitino requested to merge poc-jobs-concurrency-limit into master Oct 06, 2022

What does this MR do and why?

This is a naive PoC to implement job concurrency limit and it's submitted for explorative purpose. It took me 1h of explorative work and wanted to share the idea.

⚠ This MR should not be merged ⚠

👎 This solution has some known, possibly solvable, problems:

The limit check is not concurrent safe. If multiple jobs are submitted at the same time there is a risk that many of these jobs get to the runner queue.
It contains some inefficiencies, like scheduling the pipeline processing later in the future if the whole pipeline is stuck waiting for the concurrency to have capacity.
It uses SQL to count running jobs.

👍 Positive aspects:

We keep the builds in a waiting state outside the runner queue. Builds don't get to pending state until there is capacity in their concurrency quota.
The SQL query for the fair scheduling remains untouched.
The SQL query could possibly keep performing very well at scale. Since only a limited number of builds will make it to the queue, the query will have less builds to query.

flowchart LR

created --> capacity{full capacity?}
capacity -- Y --> waiting
capacity -- N --> pending
waiting --> capacity
pending --> running

Again, the purpose of this MR is purely to explore the solution space, understand the complexity involved and where there can be performance bottlenecks.

Screenshots or screen recordings

Screenshots are required for UI changes, and strongly recommended for all other merge requests.

How to set up and validate locally

Numbered steps to set up and validate the change are strongly suggested.

MR acceptance checklist

This checklist encourages us to confirm any changes have been analyzed to reduce risks in quality, performance, reliability, security, and maintainability.

I have evaluated the MR acceptance checklist for this MR.

Edited Oct 14, 2022 by Fabio Pitino

PoC job concurrency limit for explorative purpose

What does this MR do and why?

Screenshots or screen recordings

How to set up and validate locally

MR acceptance checklist

Merge request reports