[Unicode] Non-ASCII characters break syntax highlighting and autocompletion
Summary
Non-ASCII characters in identifier names break syntax highlighting in several languages, like Julia, XML, HTML, YML, CSS, Javascript… When you are writing code directly, it also breaks autocompletion (identifier not found when closing tag, for example).
Example Project
https://gitlab.com/BenjaminGalliot/non-ascii-character-problem-test
What is the current bug behavior?
The colors are cut or wrong for syntax hightlighting, and names are not shown for autocompletion.
What is the expected correct behavior?
The colors must behave same as pure ASCII character string, and autocompletion must work the same too.
Possible fixes
Maybe change some regex patterns in code highlighter, with full Unicode classes rather than basic ASCII ones.
Useful link
Edited by Benjamin Galliot