Spike Research: How can insights.yml be enhanced to serve as a GitLab customizable analytics dashboard?

What is the future of the insights YAML format? Is there a new version? Should we upgrade to the new version? Will that cause breaking changes and disruption for current Insights users

The insights YAML file format was developed in-house, there is no new version. For introducing YAML "schema" changes should happen in a backwards-compatible manner.

Can insights.yml embed in: VSA? Issues?

Do you mean a list of issues? There is no support for embedding lists. Lists are problematic due to pagination, probably something like top 10/20 issues would work.

Can we add custom drill-down link?

We can extend the YAML schema to support linking, assuming that the underlying charting library can handle links.

Can we add auto page refresh to load data every 15 minutes?

Currently, the insights feature uses real-time API calls so the data can be considered fresh on page load. I'd advise against adding an auto page refresh feature because it'd add unnecessary load to our backend (a user leaves the browser open).

Can we add counters tiles in Insights with near real-time measurements - data load every 15 minutes.

See my answer above.

Do you mean like total issue count? Should be doable, keep in mind that large COUNT() queries will be slow and can cause page load failures. See my related research: &7964

Can we configuration insights to allow for a collection of charts per page?

There is a grouping feature built into insights, users can define different pages with different chart configurations.

Can we add markdown Table of Contents (ToC)?

Do you mean switching from the Page dropdown to having a clickable ToC? Or do you mean adding more text, like a description for each page/chart?

Thanks for working on this @ahegyi! We've removed the Seeking community contributions label to avoid having multiple people working on the same issue.

removed [deprecated] Accepting merge requests label

Thanks @ahegyi for the clarification!

Can insights.yml embed in: VSA?

In VSA for example:

Can we giving users the option to embed below the Tasks by type chart a customize insights.yml report?
Can we give users the option to replace the VSA overview tiles with embed customize insights.yml tiles?

Do you mean like total issue count? Should be doable, keep in mind that large COUNT() queries will be slow and can cause page load failures

Yes.

Why not adding limitations to the API to avoid the slow queries?

Can we configuration insights to allow for a collection of charts per page?

There is a grouping feature built into insights

These metrics dimensions are requested by customers:

Can we support these dimensions for grouping?

Can we support these dimensions for filtering?

Other than performance, is there a limit to the number of reports per page?

Can we add markdown Table of Contents (ToC)?

Do you mean switching from the Page dropdown to having a clickable ToC? Or do you mean adding more text, like a description for each page/chart?

Both, but also to gives users an overview of the page contents, how it organized and to allow the user to go directly to a specific report within the page.

cc @danmh

@hsnir1,

Can we giving users the option to embed below the Tasks by type chart a customize insights.yml report?

The Task by Type chart shows the daily distribution of issues and MRs by the top 10 labels in the Group. If the user who configures the YML "knows" the top labels, then the Task by Type chart can be replicated in Insights.

Can we give users the option to replace the VSA overview tiles with embed customize insights.yml tiles?

It should be doable however, it requires careful planning:

Chart definition in insights
- Extend the YML definition
Add a chart to a page
- Where to place the chart?
- Sizing, will the chart fit into the page?
- CSS positioning issues
- Persisting the charts. We need to store the chart positions somewhere.

Why not adding limitations to the API to avoid the slow queries?

We already have limits on the queries, if the query runs 15s, the backend will cancel the query. The page will show an error message (broken feature).

These metrics dimensions are requested by customers:

AFAIK insights don't support dimensioning: it can "emulated" to some extent by running insights from a specific group or subgroup. Dimension/grouping is a quite expensive operation on the DB level, I don't think it would be performant enough with the current DB setup.

Other than performance, is there a limit to the number of reports per page?

No limits.

mentioned in issue gitlab-org/manage/general-discussion#17523

As a test case for this research, here is an excellent customer example for a customizable dashboard.

Base on this example, here is my proposal for a POC that will include only VSA and DORA:

In VSA Overview stage, at the bottom of the page, add the insight page dropdown.
Add a new insight page "DORA Metrics Tredns" with line chart for each DORA metrics.
- For this POC, the charts date range is the past 30 days.
- No filters for this POC.
The VSA URL will include the new DORA insight page so users can shared the link across the organization.

@ahegyi Based on your findings, is this POC feasible? Can you estimate the effort involved?

cc @danmh

@hsnir1,

In VSA Overview stage, at the bottom of the page, add the insight page dropdown.

The VSA landing page already makes several API calls to load the VSA data (aggregated data). Adding more API calls, and more things to load can slow down the page considerably => increase the error budget

We already have a date range selector for VSA, a different date range and the lack of filtering for this chart might confuse the users.

Add a new insight page "DORA Metrics Tredns" with line chart for each DORA metrics.

The Insights feature is configured by the end-user in the project's git repository using a YAML file. One way to make this work without configuration is to build an Insights configuration structure in memory and call the Insights' data collector.

Breakdown:

Adapt the Insights YAML to support different query types (DORA). Weight: 8
Create an endpoint where we can request DORA insights data (4 DORA metrics). Weight: 4

The Insights feature is configured by the end-user in the project's git repository using a YAML file

Correction: we provide a default YAML file. Users can replace it by committing a new YAML file in their repo.

@hsnir1, do you think we can conclude this spike issue?

@ahegyi Yes, could you briefly summarize the main pros & cons and your conclusion?

@hsnir1,

The Insights feature can be extended to use different data sources for rendering the charts. Having a standard way to describe charts is a big win IMHO. This gives us the ability to add or embed charts to various GitLab pages.

Pros:

We already use a YAML-based schema. Extension in a backwards-compatible manner is possible.
The chart rendering is already there, if we don't want to introduce new chart types then UI work will be minimal. Only the backend needs to change.

Cons:

In-house built, non-standard. Users might find it difficult to configure the charts.
The feature may become very complex once we start adding more features, linking/drill-down, new chart types, and new data sources.

I appreciate your summary of all these options, @ahegyi . This research is very helpful thanks!

@m_gill @danmh @nagyv-gitlab base on these results, I think we need to move forward and leverage insight to solve the Customizable Value Stream Dashboard & Reports -... (&8335) requests.

LMKWYT

Thank you for your thorough investigation @ahegyi.

base on these results, I think we need to move forward and leverage insight to solve the Customizable Value Stream Dashboard & Reports -... (&8335) requests.

@hsnir1 I agree, as insights is already for customisable reports I think it makes sense conceptually.

Spike:

Let's see a simple chart configuration from our docs:

bugsCharts:
  title: "Charts for bugs"
  charts:
    - title: "Monthly bugs created"
      description: "Open bugs created per month"
      type: bar
      query:
        issuable_type: issue
        issuable_state: opened
        group_by: month
        period_limit: 12

The outer key (bugsCharts) provides grouping for the charts (`array) for example, exposing DORA could be exposed as a group:

bugsCharts:
  title: "Charts for bugs"
  ...
dora:
  title: DORA4 metrics
  ...

The charts array contains the chart definitions, these are currently very specific to the Issuable models (Issue, MergeRequest).

A possible extension to the schema would be the following:

bugsCharts:
  title: "Charts for bugs"
  charts:
    - title: "Monthly bugs created"
      description: "Open bugs created per month"
      type: bar
      query:
        data_source: issuable
        params:
          issuable_type: issue
          issuable_state: opened
          group_by: month
          period_limit: 12
    - title: "DORA DF"
      description: "Deployment frequency"
      type: bar
      query:
        data_source: dora
        params:
          metric: deployment_frequency
          environment_tiers: 
            - production
          period_limit: 12

Within the codebase the parameter handling is implemented in the InsightsActions module which is mixed into the controller.

Idea:

Controller -> InsightsDataRequest -> Data -> Serializer

Move out query parameter related logic from the controller into the new InsightsDataRequest class. This class is responsible for validating the incoming query params and deciding which data source to invoke (and how to invoke it). The returned data is passed into the Serializer.

Pseudo code:

if params[:query][:data_source] == 'dora'
  # validate
  Dora::AggregateMetricsService...
  # serialize
elsif params[:query][:data_source] == 'issuable'
  # validate
  Gitlab::Insights::Finders::IssuableFinder...
  # call reducers
  # serialize
else
  # backward compatibility for the old YAML schema, call the issuable code
end

POC screenshot:

marked this issue as related to #366729 (closed)

removed the relation with #366729 (closed)

marked this issue as related to #367178 (closed)

mentioned in issue #349679

mentioned in issue #354653 (closed)

mentioned in issue #367248 (closed)

@ahegyi This issue looks like it may slip this current milestone. Can you leave a or to signify if you are on track to deliver this issue? Please also consider updating the issue's Health Status or Milestone to reflect its current state, and communicate with your Product Manager as appropriate.

Bot policy.

@ahegyi I think this can be closed, right?

Yep, closing it.

closed

mentioned in issue #364432 (closed)

mentioned in epic &6877

mentioned in issue #372215 (closed)

added devopsmonitor label and removed devopsfoundations label

mentioned in epic gitlab-org#6877

Spike Research: How can insights.yml be enhanced to serve as a GitLab customizable analytics dashboard?

Overview

Problem to solve

Investigation and clarification questions:

Supported materials

Expected Outcome

Designs

Child items ...

Activity

Spike Research: How can insights.yml be enhanced to serve as a GitLab customizable analytics dashboard?

Overview

Problem to solve

Investigation and clarification questions:

Supported materials

Expected Outcome

Relates to

Activity