Consider using `parallel_tests` to speed up the test suite on the CI

@rymai I think this issue could be closed right? We now use the parallel: CI job parameter, so I don't think using parallel_tests would add any further improvements?

@ddieulivol This is different as parallel_tests allow to run tests in parallel on multiple CPU cores, for each job (in contrary to multiple jobs).

Ah yes! Thanks for pointing this out

removed missed:12.10 missed:12.3 missed:12.4 missed:12.5 missed:12.6 missed:12.7 missed:12.8 missed:12.9 missed:13.0 missed:13.1 missed:13.2 labels

removed [deprecated] Accepting merge requests label

I think we might want to prioritize this issue.

Context: https://gitlab.slack.com/archives/CMA7DQJRX/p1701335399791259?thread_ts=1701332100.889769&cid=CMA7DQJRX (Internal)

There is an rspec core issue here tracking the feasibility of adding this, and some interesting things have come up. Two gems have surfaced:

turbo_tests extracted from Discourse because of the same need
flatware - which is more geared toward Cucumber tests but might be helpful

Perhaps these could be investigated¹ and we can branch this off into its own epic? It would be nice to dodge fatal: remote error: GitLab is currently unable to handle this request due to load. with more regularity

The road can be fraught with peril: https://blog.appsignal.com/2022/03/16/the-perils-of-parallel-testing-in-ruby-on-rails.html

It would be nice to dodge fatal: remote error: GitLab is currently unable to handle this request due to load. with more regularity

Could you please elaborate this more? I thought if we're running more tests in parallel, it'll increase the load for GitLab.com, therefore making it more difficult to handle requests.

I also don't really understand how running tests in parallels in a single job is better than running tests in parallels in multiple jobs like we're doing now. If our runners have multiple cores then indeed that could utilize more CPU I think I thought they're all virtualized so maybe it's not too different than using more parallels jobs though?

That error is from Gitaly unable to handle the many git clones.

The idea is that we lessen the number of clones needed if we parallelize within a job.

Thanks, that makes sense.

Given the complexity (mostly around database and other persistent states), I might try using artifacts to pass the repository first then I think this will be simpler.

(I edited and removed your quoted messages)

Yeah, if it's only to lessen load on Gitaly, I think it's not worth the complexity.

I might try using artifacts to pass the repository first then

We already implemented something like this before using CI_PRE_CLONE_SCRIPT to fetch the repo from a GCS bucket. But we removed it when we implemented caching at the Gitaly level. #39134 (comment 804417099)

I think the main benefit here is reducing the feedback loop during development. Improving CI will be a side effect. We have powerful multi core machines, but still execute single threaded tests, which is quite painful.

We already implemented something like this before using CI_PRE_CLONE_SCRIPT to fetch the repo from a GCS bucket. But we removed it when we implemented caching at the Gitaly level. #39134 (comment 804417099)

Yeah, I am aware and I wonder if we want to introduce something similar back.

I think the main benefit here is reducing the feedback loop during development. Improving CI will be a side effect. We have powerful multi core machines, but still execute single threaded tests, which is quite painful.

This sounds like the main goal is for local development? In that case we need to update GDK as well. Since the issue title says "on the CI", I was always under the impression that this was about speeding up CI (as well as the epic about pipeline).

@godfat-gitlab

This sounds like the main goal is for local development? In that case we need to update GDK as well. Since the issue title says "on the CI", I was always under the impression that this was about speeding up CI (as well as the epic about pipeline).

If we introduce parallelization with parallel_tests, turbo_tests , or some other gem, then it should work with minimum effort required in CI and GDK. My intention was to highlight the importance of this change and remind that we should keep local development in mind. I don't think it's very important in what epic we put this issue

I think my point is that looking at https://github.com/grosser/parallel_tests#add-to-configdatabaseyml

~~It does not contain scripts to set up multiple database, including PostgreSQL~~ (Edited: Actually, it probably does contain scripts to set up multiple database: https://github.com/grosser/parallel_tests#create-additional-databases), Redis, Gitaly, and so on mentioned in the issue description. Setting up in CI and GDK can be much different as well.

We should start from either CI or GDK, and they solve much different problems, and we need to know what problems we're trying to solve in order to decide which we try first and if it's indeed solving the problem.

Specifically, I am unsure what we can solve for the CI issue, and for local development, that's a separate concern and if that's the main goal here, we can repurpose the issue (or create a new issue so that we don't mix ideas)

Given the complexity (mostly around database and other persistent states), I might try using artifacts to pass the repository first then I think this will be simpler.

It's not relevant to this issue, but since it's brought up, I made a merge request to do this for RSpec jobs: !140330 (merged)

assigned to @garyh

unassigned @garyh

Consider using `parallel_tests` to speed up the test suite on the CI

Designs

Child items ...

Activity

Consider using `parallel_tests` to speed up the test suite on the CI

Relates to

Activity