Runner Fleeting / Taskscaler / GRIT Test Plan (#36787) · Issues · GitLab.org / gitlab-runner

These new components need to be tested on the unit, integration and end-to-end level. This issue outlines what testing is in place and what is needed for completeness. And which various environment / dimensions we should test.

Proposal

                                              Where do we start runner manager?
                      ______________________________________/\____________________________________
                     /                                                                            \
                    |                                                                              |

                    +------------------------------------------------+-----+-----+-----+-----+-----+    ___
                    | GitLab.com (runner-incept)                     | GKE | GCE | EC2 | EKS | ... |       \
                    |  +---------------------+-----+-------+-------+ |     |     |     |     |     |       |  
                    |  | Runner Binary       |     |       |       | |     |     |     |     |     |       |
              ___   |  | +-----------------+ |     |       |       | |     |     |     |     |     |       |
     Runner  /      |  | | Runner Packages | | ... | ...   | ...   | |     |     |     |     |     |       |  Runner
     Integ.  |      |  | +-----------------+ |     |       |       | | ... | ... | ... | ... | ... |       |  E2E
     Tests   |      |  | | Taskscaler      | |     |       |       | |     |     |     |     |     |       |  Tests
             \___   |  | +-----------------+ |     |       |       | |     |     |     |     |     |       |
                    |  +---------------------+-----+-------+-------+ |     |     |     |     |     |       |
                    |  | Fleeting Plugin AWS | GCP | Azure | Local | |     |     |     |     |     |       |
                    |  +---------------------+-----+-------+-------+ |     |     |     |     |     |    ___/
                    +------------------------------------------------+-----+-----+-----+-----+-----+

                       |                                           |
                        \___________________  ____________________/
                                            \/
                                 Where do we run the job (runner)?

Test Types

The runner binary can be started in a variety of environments. For testing purposes we start the runner binary inside a GitLab job, thus the name "runner-incept". A GitLab job is started in a runner, which downloads the runner binary (and a plugin), registers it with GitLab and then runs a "hello world" type job to verify it works (runner in a runner).

However this doesn't truly replicate a setup which customers would use. A true end-to-end test would setup an environment like a cluster Google Kubernetes Engine (GKE) or Amazon Kubernetes Service (EKS) or a VM in Google Compute Engine (GCE) or Amazon EC2.

We also have integration tests for each component. The runner integration tests are standard Golang tests with a file suffix of integration_test.go and a //go:build integration build tag. These runner tests execute jobs in an actual environment (local shell, Kubernetes cluster, etc...) But they stub out GitLab and inject the JobResponse payload and verify the job output logs without sending them. Likewise each plugin can have an integration test which stubs out fleeting API calls and verifies resources created in the target environment.

And each package (and sometimes each file) has unit tests of small units of functionality.

We won't test the combination of every runner environment with every job environment with every scenario in every test type. Instead we will get comprehensive test coverage of each dimension in a single test type.

Runner Incept

We will test a single "hello world" scenario in each of the plugins in runner-incept:

End-to-End

We will test a single, realistic user scenario end-to-end in each environment in which might setup the runner:

Integration

Unit

And of course we aim for 100% unit test line coverage, but set the bar somewhere like 85-90%.

Runner Fleeting / Taskscaler / GRIT Test Plan

Description