[Dot] Unicode parsing bug
Ported Issue from Mantis Original ID: 989 Reported By: Karl Chen
SEVERITY: MINOR Submitted: 2005-10-26 20:27:40
OS: --
VERSION: 2.2.1
DESCRIPTION
The following tooltip: "GauÃ\n" (that's a U+00DF before a \n)
becomes "GauÃ\n" when outputting postscript.
Non-Unicode characters before \n are OK; other formats are OK.
STEPS TO REPRODUCE
GAUSS [label="Carl GauÃ\n1799"];
ADDITIONAL INFORMATION
[quarl] It works fine normally, at least for the latin1 subset, to simply encode it in UTF-8. I worked around the Gauss\n problem by adding a space between ss and \n -- I didn't mention this in the bug report because I thought you knew Unicode worked most of the time.
From looking at the postscript output it seems like a bug in parsing \n, because everywhere else \n is split into multiple PS commands; here it was encoded as one PS line with "\n".