Refactor Golden Master spec approach for testing GFM support (!68671) · Merge requests · GitLab.org / GitLab

Chad Woolley requested to merge 338268-refactor-golden-master-spec-approach-for-testing-gitlab-flavored-markdown-support-in-the into master Aug 20, 2021

Problem to solve

Gitlab-Flavored Markdown (GFM) is large and has many edge cases to test. The Content Editor currently supports rendering some of the GFM content but we eventually need to support it all. If something is added to GFM, we need to have some kind of way to test our support against a Golden Master (GM) specification.

Background

See this original MR description for definitions and explanation of what "Golden Master Testing" (AKA "Characterization Testing") is, and how we are using it.

Requirements

Frontend: We should have test coverage that the Content Editor can properly serialize HTML to Markdown for all GFM source elements which it currently supports.
Frontend: We should have test coverage that the Content Editor can properly render the expected HTML for all GFM source elements which it currently supports (not currently supported until we implement this).
Backend: We should ensure that for all GFM elements, the backend always renders the expected HTML, for all supported GFM source elements.
If any of this this ever changes unexpectedly, tests will start failing, and force the same change to be made on the backend and frontend.

Implementation Plan

What we are planning on doing is actually a type Golden Master testing with modifications:

The original markdown examples used to drive the tests are taken from the YAML, and can be considered a form of "fixture" in this case.
The HTML in the YAML is the "Golden Master", but we are going to use it to assert against TWO different implementations of markdown rendering:
1. The frontend one implemented as Jest specs.
  1. This will assert both HTML -> markdown serialization (what it currently does), as well as
  2. Markdown -> HTML rendering (not currently supported until we implement this). This will likely be a standalone module outside of the Content Editor.
2. The backend one implemented as requests specs
  1. This will assert markdown -> HTML conversion by the backend.

Tasks

References

Comments on !55856 (merged)
#323766 (closed)

Screenshots or Screencasts (strongly suggested)

How to setup and validate locally (strongly suggested)

Does this MR meet the acceptance criteria?

Conformity

I have included changelog trailers, or none are needed. (Does this MR need a changelog?)
I have added/updated documentation, or it's not needed. (Is documentation required?)
I have properly separated EE content from FOSS, or this MR is FOSS only. (Where should EE code go?)
I have added information for database reviewers in the MR description, or it's not needed. (Does this MR have database related changes?)
I have self-reviewed this MR per code review guidelines.
This MR does not harm performance, or I have asked a reviewer to help assess the performance impact. (Merge request performance guidelines)
I have followed the style guides.
This change is backwards compatible across updates, or this does not apply.

Availability and Testing

I have added/updated tests following the Testing Guide, or it's not needed. (Consider all test levels. See the Test Planning Process.)
I have tested this MR in all supported browsers, or it's not needed.
I have informed the Infrastructure department of a default or new setting change per definition of done, or it's not needed.

Security

Does this MR contain changes to processing or storing of credentials or tokens, authorization and authentication methods or other items described in the security review guidelines? If not, then delete this Security section.

Label as security and @ mention @gitlab-com/gl-security/appsec
The MR includes necessary changes to maintain consistency between UI, API, email, or other methods
Security reports checked/validated by a reviewer from the AppSec team

Related to #338268 (closed)

example of using Nokogiri for normalization

-        post api_url, params: { text: markdown, gfm: true }
+        post api_url, params: { text: example_markdown, gfm: true }
         expect(response).to be_successful
-
         response_html = Gitlab::Json.parse(response.body).fetch('html')
-        unescaped_response = CGI.unescape(response_html)
-        normalized_html = normalize_html(html)
-        normalized_unescaped_response = normalize_html(unescaped_response)
-        expect(normalized_unescaped_response).to eq(normalized_html)
+        normalized_response = normalize_html(response_html)
+
+        expect(normalized_response).to eq(normalized_example_html)
       end

   def normalize_html(html)

```
   p html
```

   html_doc = Nokogiri::XML(html) do |config|

```
     config.noblanks
```
```
   end
```
```
   p html_doc.to_xhtml(indent: 2)
```

   # TODO: Provide a way to do subsitutions of variable values (urls, ids, etc) in the HTML.

```
   html
 end
```

Edited Nov 11, 2021 by Chad Woolley

Refactor Golden Master spec approach for testing GFM support