Ticket #197 (new defect)
Non-BMP unicode characters are dumped using surrogate code units if python was not configured with UCS4
|Reported by:||travis.mcleskey@…||Owned by:||xi|
If python is built with UCS2 (the default, for example, on OS X), you get:
The output should instead be "\U0001D10C", since the surrogate code units are not valid unicode characters.
- Summary changed from Dump 32-bit "\U"-style escape sequences, even if python was compiled without UCS4 to Dump 32-bit "\U"-style escape sequences for non-BMP characters, even if python was compiled without UCS4