Cluster Image Scanning (Vulnerability Scans against Running Containers) (#3410) · Epics · GitLab.org

Cluster Image Scanning (Vulnerability Scans against Running Containers)

### Release notes  ### Problem to solve  Although customers are able to do container scanning as part of their pipeline jobs today, there is no guarantee that the images for the containers that are running in production have been scanned recently. Some customers have production container images that were deployed several years ago and have not been updated. Users need a way to regularly re-scan the container images that are actually running in production so they can understand their current security risk. ### Intended users  Primary Personas: * [Devon (DevOps Engineer)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#devon-devops-engineer) * [Delaney (Development Team Lead)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#delaney-development-team-lead) * [Cameron (Compliance Manager)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#cameron-compliance-manager) Secondary Personas: * [Sasha (Software Developer)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#sasha-software-developer) * [Sam (Security Analyst)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#sam-security-analyst) * [Alex (Security Operations Engineer)](https://about.gitlab.com/handbook/marketing/product-marketing/roles-personas/#alex-security-operations-engineer) ### User experience goal  ### Proposal  1. Users will be able to schedule regular container scans of the images that were used to initialize the containers that are running in their production environment 1. The container scan will identify known vulnerabilities (CVEs) in the OS and in the packages that are installed on those images 1. The scan findings will be displayed on the Vulnerability Report under a new `Operational Vulnerabilities` tab. 1. Users will be able to filter the operational vulnerability results by which Kubernetes cluster those results came from. The "cluster" will map back to clusters connected via GitLab Kubernetes Agents. 1. The scanner will not require additional credentials beyond what is already collected when connecting a Kubernetes cluster to GitLab. 1. Users will be able to view vulnerabilities related to a specific cluster in a new `Security` tab when viewing agent-connected clusters. This tab will not be available for certificate-connected clusters. **Note:** These requirements have been updated from the original list to remove support for Kubernetes clusters connected via the certificate method given that connection method has been deprecated. #### Designs (please see [Design issue](https://gitlab.com/gitlab-org/gitlab/-/issues/219173) for more details) - 🎨 [Figma file](https://www.figma.com/file/w1foPwswNuOKWgi2IvLHZm/Running-container-vulnerabilities?node-id=175%3A0) - 📽 [Video walkthrough](https://www.loom.com/share/904a1b795833416b9e251cea68ddcab7) - 🎟 [Design issue](https://gitlab.com/gitlab-org/gitlab/-/issues/219173) **Vulnerability report** | `Development vulnerabilities` (this is the existing vulnerability report) | `Operational vulnerabilities` | Empty state (no policies) | Empty state (no vulnerabilities) | | ------ | ------ | ------ | ------ | | ![vuln-list-project](/uploads/eafdf7a628936ca1893dd40473edb043/vuln-list-project.png) | ![environment-vulns-default-filter](/uploads/9cb7e48b1750dcdc4cc66461b495bac2/environment-vulns-default-filter.png) | ![environment-vulns-empty-state](/uploads/58be95692853d688cf82c5b2b897170b/environment-vulns-empty-state.png) | ![environment-vulns-no-results](/uploads/25680fefd5434f6b16ed0ebbe9876b1c/environment-vulns-no-results.png) | **Cluster detail page** | Agent managed cluster (security tab) | Agent managed cluster (access tokens tab) | | ------ | ------ | | ![Agent_managed_cluster_-_security](/uploads/2ada2fef057be1005b7cf6bc87276d99/Agent_managed_cluster_-_security.png) | ![Agent_managed_cluster_-_access_tokens](/uploads/4d8a851ee5b72761fdfb60f4954d768c/Agent_managed_cluster_-_access_tokens.png) | ### Further details  ### Implementation Details #### 1. Iteration 1 - add ability to start Cluster Image Scanning job Requirements: * user can get vulnerability reports from the cluster where Starboard Operator is configured, * user can provide additional token in Kubernetes cluster settings in GitLab to provide token to `vulnerability-viewer` service account (with `get`/`list` permissions to `vulnerabilityreports.aquasecurity.github.io`), * documentation is updated with information on how to install and use Starboard Operator with GitLab, How we can achieve that? 1. Implement new template (`lib/gitlab/ci/templates/Security/Cluster-Image-Scanning.gitlab-ci.yml`) that will responsible for performing the scan. 1. Implement new analyzer (like https://gitlab.com/gitlab-org/security-products/analyzers/cluster-image-scanning) to get the results from the cluster. 1. Update the documentation with additional information about cluster image scanning 1. Extend the `Enums::Vulnerability::REPORT_TYPES` const with new report type `cluster_image_scanning`. #### 2. Iteration 2 - add ability to schedule Cluster Image Scanning job periodically Requirements: * user can schedule Cluster Image Scanning job using Scheduled Scan Execution Policies, * documentation is updated with information how to configure Scheduled Scan Execution Policy to start Cluster Image Scanning job, How we can achieve that? 1. Extend service responsible for scheduling security jobs (implemented in https://gitlab.com/gitlab-org/gitlab/-/issues/325230) with ability to schedule Cluster Image Scanning Scan #### 3. Iteration 3 - use Kubernetes Agent to fetch results **Note:** Before doing this iteration we need to understand and find a place to put these vulnerabilities from Kubernetes Agent. In previous iterations, we are reusing the current mechanism (successful pipeline for default branch -> creates vulnerabilities from scanner JSON report into the database), in this iteration we cannot use that mechanism. We need to detach vulnerability from the pipeline and store it (both on the UI and on the backend side) differently. Currently, we are working on the design for that: https://gitlab.com/gitlab-org/gitlab/-/issues/219173/ Requirements: * user can fetch vulnerabilities from Starboard Operator through Kubernetes Agent, * user sees vulnerabilities found in running containers in Security Dashboard in Container Tab (https://gitlab.com/gitlab-org/gitlab/-/issues/219173/) How we can achieve that? 1. Extend `kas` to support reading vulnerabilities from Starboard Operator (similar to what we did with Cilium Alerts: https://gitlab.com/gitlab-org/cluster-integration/gitlab-agent/-/merge_requests/211) (we need to add new `ClusterRole` to `list`/`get` `vulnerabilityreports.aquasecurity.github.io`) 1. Extend `API::Internal::Kubernetes` (https://gitlab.com/gitlab-org/gitlab/blob/master/ee/lib/ee/api/internal/kubernetes.rb#L12) to support creating vulnerabilities for related projects. ### Permissions and Security  There will be no change to current permission levels for the Security Dashboard or the Vulnerability Report. ### Documentation  1. Documentation will be added to describe how to start using this feature and to schedule a container scan against a production environment 1. Existing documentation about the [Security Dashboard and Vulnerability Report](https://docs.gitlab.com/ee/user/application_security/security_dashboard/) will be edited to note that findings from Container Scans against production environments will be displayed. ### Availability & Testing  1. Tests will be performed to verify that this feature continues to work when users have enabled Container Network Security and Container Host Security with the default settings 1. Tests will be performed to assess the performance impact of running a scan against a cluster 1. Tests will be performed to verify that running a scan does not interfere or prevent the production application from continuing to run and service requests during the duration of the scan ### What does success look like, and how can we measure that?  ### What is the type of buyer?  Use of the scans will be available down to ~"GitLab Core" Viewing the scan findings in the Security Dashboard and Vulnerability Report will be limited to ~"GitLab Ultimate" ### Is this a cross-stage feature?  ### Links / references  *This page may contain information related to upcoming products, features and functionality. It is important to note that the information presented is for informational purposes only, so please do not rely on the information for purchasing or planning purposes. Just like with all projects, the items mentioned on the page are subject to change or delay, and the development, release, and timing of any products, features, or functionality remain at the sole discretion of GitLab Inc.*

epic