Skip to content

Revamp reply emails parsing

This is an attempt to summarize all the issues with reply emails parsing, making it easier to find clues.

All related issues

Challenges

  • Different email clients (e.g. gitlab-ce#18388)
  • Different languages
  • HTML emails
  • Auto-generated emails
  • Signatures

Suggested solutions

  • We leave markers which we could recognize later in the emails (I think Discourse is doing this, also a ton of support tickets system)
  • Have a list of different formats email clients could be using (some clients would use | for quoting)
  • Don't use Markdown, just plaintext (GitHub is doing this, but this could still be very terrible. Here's an example of woes)

Reference implementation

Some stopped effort


/cc @smcgivern @DouweM @MrChrisW @dblessing

Edited by 🤖 GitLab Bot 🤖