Remove friction for review apps to be deployed so environments are available for UX and PM

added Quality ci-build typefeature labels

@dimitrieh this is the issue for more aggressive review app deployment.

Please feel free to use this issue to capture the ask from gitlab-ce~2024184 as well.

mentioned in epic &25

added to epic &606

thanks @meks. I'll copy my original statement here:

Quick question related to review apps. Are we able to make review apps spin up even if tests do not pass?

Review apps can be seen as a test themselves. The experience and usefulness of review apps is degraded severely when a review app becomes unavailable only because for example some linting/ee-compat/code-quality tests for example fail. The iteration cycle between dev and review should be as short as possible .

Example https://gitlab.com/gitlab-org/gitlab-ce/pipelines/70836426

Review stage is completely skipped just because of an ee-compat check

@vkarnes it would be great if we can give this an incentive from a gitlab-ce~2024184 perspective. Often the review apps are not working/deployed because of this reason. A simple test often proves to be a blocker for deploying an environment which is meant to be a test on itself.

Okay, work with PMs to get it prioritized!

@vkarnes which PMs should I work with for this?

cc: @meks

As a little visual to support this case; Very often when opening a merge request I am greeted with the following and no review app available.

I would loop in all the CI/CD PMs and let them determine who will take point.

If it needs just arrangement of the CI stages/configuration, Engineering Productivity team can do this. We are trying to get the review apps to be faster and more stable before allowing more usage.

That effort is tracked in: Getting review app success rate to be 90% and Speeding up dependencies of review apps

cc @rymai @kwiebers

But if we want feature enhancements in the review app itself then we need help from @jlenny.

@meks this issue was discussed in the UX/FE meeting of 22 Aug 2019. Do we have a new update regarding progress on this? This would be very welcome towards the functional and visual review process.

Is there a possibility to expose this issue a bit more?

@dimitrieh thanks for the ping. There are 2 streams of dependency needed before we can get to this.

We are currently working on de-complexifying the current CI pipeline to reduce the wall time https://gitlab.com/gitlab-org/gitlab-ce/issues/65702 and using DAG to reduce this further https://gitlab.com/gitlab-org/gitlab-ce/issues/65609
We are making sure that review apps spin up successfully https://gitlab.com/gitlab-org/quality/team-tasks/issues/93 We were at 90% but now back to 70%

Once these 2 workstreams are completed then we can open this up more. If we enable it now it would add more complexity to the configuration. I have optimistically put %12.5 on this.

@kwiebers I am assigning this to you for now since we need someone to drive this from Eng Prod. I am optimistically setting this to %12.5 by then we should have more stable deploys with metrics to back it up so review apps can be expanded further.

@dimitrieh - Engineering Productivity has modified the review app deployment logic to align with FE changes with !27010 (merged). This along with #212529 (closed) have improved success rate of Review App deployment to around 90%.

review-build-cng and review-deploy was adjusted to be run based on less dependencies with !24803 (merged). Are you still experiencing the same friction with how often review apps are deployed?

Thanks @kwiebers. I'll keep this in mind and watch out if I find any further friction!

@kwiebers bringing visibility that this topic has been discussed in the UX weekly yesterday and it still seems to be a problem. Would love for you to look into this

See https://docs.google.com/document/d/189WZO7uTlZCznzae2gqLqFn55koNl3-pHvU-eVnvG9c/edit#bookmark=id.z71inllbd757 for more information (also a recording should be available)!

@dimitrieh - Thanks for passing this along. I'll catch the recording and report back with any questions.

@gl-quality/eng-prod - There's a great discussion on product designer experience challenges in the 2020-05-05 discussion starting at 5:02 at https://drive.google.com/drive/folders/1UlcKXChnh4I7Kp95fsdJj-XKTsKPUYgn (couldn't get a shareable link or direct timestamp link)

I'd like to hear more about Review App success challenges. I know the success rate has been lower than 70% since February but has improved to 90% for April.

@dimitrieh @mvanremmerden - Are you and the other UX team members seeing challenges since some of the corrective actions during April?

Some of the other items that drew me in were related to these topics:

Feature flags for review apps

There are a few ways to set Feature Flags in a review app:

Setting a Feature Flag by the API https://docs.gitlab.com/ee/development/testing_guide/review_apps.html#enable-a-feature-flag-for-my-review-app
If needing to do this via Rails Console then https://docs.gitlab.com/ee/development/testing_guide/review_apps.html#run-a-rails-console can be used. This does require access request to get developer access to the cluster.

I think this could be improved and created #216838 (closed) to track this.

Slow cycle time for quick changes

This definitely falls under Engineering Productivity area. The quote that resonated with me "CSS took 15 minutes to change and MR has lingered for a couple of days"

I appreciate hearing cases of very small MRs where the MR process seems to take longer than it should.

@hollyreynolds and @pedroms - Could you share some examples of these MRs or changes to look at a little closer?

@mvanremmerden - Are you and the other UX team members seeing challenges since some of the corrective actions during April?

@kwiebers I opened it 3-4 times the last month, the CSS was broken each one of these times like that.

This is one example MR with that behavior: !30587 (merged)

In addition to that, most of my MRs are around the Web IDE, and in the past I also had trouble with getting that to work in review apps, but that might have been solved as we also made some changes to be more flexible for the gdk, as can be seen here: gitlab-development-kit!982 (comment 315333982)

I opened it 3-4 times the last month, the CSS was broken each one of these times like that.

@mvanremmerden - I see, originally we believed this that issue was addressed with !26641 (merged) but have seen this resurface based on #212643 (comment 337313905)

These type of review app not being successful would not surface in the Deployment Success Rate metric that I referred to above. If you observe issues where review-deploy passes but the deployment is not working how you would expect please let us know on either Slack (#g_qe_engineering_productivity) or an issue in gitlab-org/gitlab .

In addition to that, most of my MRs are around the Web IDE, and in the past I also had trouble with getting that to work in review apps

@mvanremmerden - Can you expand on the trouble that you've had with Web IDE for Review Apps? Is it the same CORS error or something else?

but that might have been solved as we also made some changes to be more flexible for the gdk, as can be seen here:

Review apps use the GitLab Charts and isn't leveraging the GDK for these type of improvements. I'd be curious if the issues are the same what's happening because it may be a Review App implementation problem or an upstream GitLab Charts problem.

If you observe issues where review-deploy passes but the deployment is not working how you would expect please let us know on either Slack (#g_qe_engineering_productivity) or an issue in gitlab-org/gitlab.

I will try to do that more again, thanks!

Can you expand on the trouble that you've had with Web IDE for Review Apps?

The Web IDE was relying on a certain host that was configured somewhere in the environment. As you can see in the gdk example, that was something that broke in a couple of places over the last year (and if I remember correctly, at a certain point, the review apps were one of them), but we made some changes to make that more flexible, so it might already be fixed.

Added a point to this discussion in next week's UX weekly meeting agenda!

@dimitrieh @mvanremmerden This issue specifically should have been fixed by !31336 (comment 338483541)

Let us know if you still have any problem with it, or create a new issue if you see new problems.

@kwiebers Here are a couple of MRs that hung around for me for a while but this was primarily due to pipeline failures and addressing problems there. Not sure if that helps?
!29212 (merged)
!30636 (merged)

@kwiebers, here is a pipeline that had all tests passing but the review apps won't deploy. Here is the MR !32895 (merged) I hope this helps.

https://gitlab.com/gitlab-org/gitlab/-/jobs/569558509

@kwiebers @caalberts yesterday I have opened 4 merge requests with minor changes (change icons or replace button with new component). All of them have failed to spin up the review app:

Replace fa-link icons with GitLab SVG link icon (!36973 (merged))
Replace fa-plus icons with GitLab SVG plus icon (!36972 (merged))
Replace with in app/assets/javascripts/pipelines/components/graph/linked_pipeline.vue (!36968 (merged))
Updates deprecated button in in app/assets/javascripts/pipelines/components/graph/action_component.vue with Pajamas component (!36966 (closed))

What can I do differently here to make sure the review apps are reliable?

Additionally, there is an interesting conversation in slack:

I’d like to understand our product testing capabilities better. Should either gdk or a review app be viewed as a better way to test? Or are they equal? Why are review apps created from some MRs, but not others?

Review apps are created via a CI job. If the job doesn't run or fails before the review app is deployed, no review app will be available. Projects that are not using CI or do not have the necessary job, will not have a review app created.

In my experience, review apps are great if you are reviewing something in the application that is fairly basic and the job runs/passes. Review apps are not set up to include every feature, but we have this issue open to populate review apps further #25297 (closed)

This is where GDK can come in because we have special set ups that allow us configure certain features to test locally.

Imo review apps are the superior testing method in an ideal world.

They allow for anyone to spin up the testing env in an instant just by using a browser. No complicated tools necessary. This is incredibly powerful!

This makes it possible to evangalise functional reviews to additional stakeholder persona’s such as PM’s, tech writers, marketing, etc.

They are also made available soon to non project members with #22090 (closed) However, there are some draw backs currently with using a review app:

They rely on a successful pipeline, meaning that if it fails or is not kicked off there is no review app. (This happens fairly often)

Review apps are only kept online for 2 days since spinning up afaik.. this means any review app that is spun up on Friday, is not available on Monday. Alternatively, this requires activity on the merge request within a 2 day period (this is a recurring theme).

Review apps are indeed as stated above not always provisioned with the data needed to test the feature in dev. Additionally, any data setup is deleted after a new commit is pushed and a new review app is spun up (again a recurring theme).

In the past I have opened issues to improve 1. and 2. (I would need to look them up though). However, for review apps to become a true tool in our arsenal this needs a greater amount of prioritisation.

Hope that helps!

Thanks for all that. So overhead aside, it sounds like review apps are preferred, but gdk would be more robust?

Review apps save you from checking out a branch and spinning up gdk so I can see why it would be preferred by many who are not doing development. I don't personally prefer one over the other. I often want to make changes in the codebase while reviewing, so I usually default to gdk. They both have their place, depending on the MR you are reviewing.

@dimitrieh There had been a problem with review apps that was fixed yesterday. See #229145 (closed) for detail. Please rebase your MRs with master.

@dimitrieh - How can Engineering Productivity share these type of breaking issues for Review Apps in a more efficient manner? Would communication in #ux on Slack be the best place?

@kwiebers I think that would be a great place to start.

The problem here is that every failed deploy of a review app without a good reason decreases the trust placed in review apps working in general. My assumed thoughts of the general thinking is; can I rely on this or not?

I have opened a quick poll in the UX slack channel around reliability: https://gitlab.slack.com/archives/C03MSG8B7/p1594911042391900

@caalberts can you take a look again? It seems review apps are still not spun up for 3/4 merge requests

I have opened a quick poll in the UX slack channel around reliability: https://gitlab.slack.com/archives/C03MSG8B7/p1594911042391900

@dimitrieh - Thank you! I am aware that the last 2 months has been poor for reliability due to a number of upstream issues (#229160 (closed), #229145 (closed), #220947 (closed), #219198 (closed)) and the Security issue which required the cluster to be rebuilt.

We have these new issues which reduce the potential for review apps from what they previously were in the old cluster.

can you take a look again? It seems review apps are still not spun up for 3/4 merge requests

I think you are referring to these 3:

In each case, jobs in the test stage failed so I do not believe review-deploy is run. @gl-quality/eng-prod - Is my understanding correctly?

In each case, jobs in the test stage failed so I do not believe review-deploy is run. @gl-quality/eng-prod - Is my understanding correctly?

Correct!

@kwiebers @caalberts @rymai FYI the results from the poll. I would love to know how I can be of help any further and to see how we can further prioritize and improve our review app offering here!

Screenshot

Usability study

I have set up a dedicated Usability research in Dovetail in order to highlight the results from the poll:

The presentation can be seen at https://dovetailapp.com/projects/9ced58af-beb1-47e8-9c1b-b7f6d6ad637f/insights/present (internal)

️ Results balance

Painpoints balance	Upside balance

Painpoints

Configuration of Review Apps while needed is limited	Reliability of review apps in the GitLab project impedes its efficiency	Accessing review apps is hindered by login friction

Positive feedback

Review apps are seen as incredibly useful and are considered so for projects other than GitLab.

cc: @nudalova @jeldergl @clenneville

Thanks for this useful information @dimitrieh. The friction aspect is very helpful. We are actively planning how to seed the data in a more efficient manner, this will likely be a Q3 OKR for us and it will be used in review apps as well gitlab-com/www-gitlab-com#7397 (closed)

The unreliable part is being measured as a KPI, wanted to call out that some of these errors are also because we are dogfooding our cloud native installer which still has room for improvements.

We will have something in this line of work to improve the situation. cc @kwiebers @tpazitny for the demo data aspect.

That sounds great @meks! Wondering if a collaboration with UX might prove useful here to keep tabs on the perceived experience here vs raw data as that will ultimately influence how much it is used as well. This might prove helpful as well as a platform for error reporting so failures do not go without reason/background knowledge

thank you for running this small research activity @dimitrieh ! Very useful insides.

@dimitrieh do we have a known catalog of seed/demo data that the UX dept uses frequently? We could look at expediting some of this data into the review apps.

@meks Not that I know of .

I know we have demo's created throughout time... Some groups that might relate to this:

Groups · GitLab dogfood · GitLab https://gitlab.com/GitLab-dogfood
Groups · demos · GitLab https://gitlab.com/gitlab-com/customer-success/demos
Groups · Demo Systems · GitLab https://gitlab.com/gitlab-com/customer-success/demo-systems
Groups · demos · GitLab https://gitlab.com/gitlab-org/ci-cd/demos

@tauriedavis, @pedroms, or @mvanremmerden might know more here!

If I remember correctly, having demo projects correctly set up for testing features is something that the Secure team is also struggling with, so this might be interesting as well for @jmandell.

Thanks for ping @mvanremmerden Actually both of my teams have had issues getting a GDK setup that supports their use cases. Configure/Monitor struggle with getting clusters and terraform setup etc to see those UIs in action and Sec/Def also need dummy projects to see their UI in action.

Ping for more clarify: @gitlab-com/gitlab-ux/secure-defend-ux @gitlab-com/gitlab-ux/configure-monitor-ux

For Configure features, you need to set up QA Tunnel access in order to create a cluster and install GitLab Runner. This allows you to test Auto DevOps or other features such as Serverless and Prometheus. This isn't as simple as having a demo project but maybe someone smarter than me can figure out a way to automate this setup further.

If using a review app, you could skip the QA tunnel portion needed for local development but I imagine a blocker could potentially be the cost of automatically spinning up a cluster for each review app?

@gl-quality/managers please kindly follow along the discussions here. We are working on seed data this Q3 but the valuable feedback here points to more opportunities besides just static project/issues/MR/CI setup.

At Monitor we're experiencing the same challenges @tauriedavis described in her comment above. I need to have QA Tunnel set up to create a cluster, install Prometheus, and deploy to an environment to monitor Metrics.

@svistas @ddavison given the nice work we have done with K3s to decrease the cost to setup autodevops tests, would it be possible to use K3s to help with some efficiencies here?

Context from !36207 (merged)

k3s is using the GitLab Tunnel and the test app is deployed using Auto DevOps but we would at least be saving the costs of running time in GKE clusters

Posting the following link for visibility as this was mentioned athttps://gitlab.slack.com/archives/C010NAWPRV4/p1596591362137200?thread_ts=1596580489.116200&cid=C010NAWPRV4

https://gitlab.com/gitlab-org/gitlab-development-kit/-/tree/master/doc/howto/auto_devops#prerequisites-for-gitlab-team-members-only

changed title from Remove friction for review apps to be deployed spin up to Remove friction for review apps to be deployed so environments are available for UX and PM

added [deprecated] Accepting merge requests label

removed [deprecated] Accepting merge requests label

changed milestone to %12.5

added [deprecated] Accepting merge requests label

assigned to @kwiebers

removed [deprecated] Accepting merge requests label

changed milestone to %12.5

added Engineering Productivity label

added 1 deleted label

removed 1 deleted label

added missed:12.5 label

changed milestone to %12.6

added priority2 label

added 1 deleted label

changed weight to 5

mentioned in issue #216838 (closed)

mentioned in issue gitlab-org/ops-section/general#7 (closed)

added 1 deleted label

added 1 design

A very small (I think) improvement that I've mentioned to @meks would be adding a description to each demo project of what's seeded/configured. For example, in this screenshot, replace "My awesome project" with what's actually available in the project.

removed 1 deleted label

mentioned in epic &6660 (closed)

@kwiebers @meks FYI as part of FY22-Q3 KR: Improve the quality of design MR reviews we ran an internal survey last August for Product Designers. The responses are consistent with the poll responses that were previously shared in this issue. To learn more, check out the research insight.

In theory, review apps would be more accessible than local GDK or Gitpod, but responses show that review apps are largely unused (31.82% have never used them).

Pros: Useful to review documentation changes (Pajamas or GitLab handbook).

Cons: Some mentioned their lack of familiarity with review apps. They are seen as unreliable even though they are much more reliable than what they were in the past. As mentioned in the local GDK section, using review apps often means setting up the environment from scratch.

Engineering Productivity has improved stability and usefulness a lot for Review App implementation in gitlab-org/gitlab since this issue was created.

I'm going to close this based on the improvements listed below. Please open new issues and mention me if there's specific usefulness opportunities to improve UX/PM interaction with Review Apps that aren't captured in &606 or a child epic.

Seed data: #25297 (closed)
Default configuration supporting runners and more GitLab features by default
Visual Review tools: #340088 (closed)

closed

changed milestone to %Next 4-6 releases

Remove friction for review apps to be deployed so environments are available for UX and PM

Proposal

Designs

Child items ...

Activity

Feature flags for review apps

Slow cycle time for quick changes

Usability study

️ Results balance

Painpoints

Positive feedback