Modify

Ticket #197 (new defect)

Opened 3 years ago

Last modified 34 hours ago

Non-BMP unicode characters are dumped using surrogate code units if python was not configured with UCS4

Reported by: travis.mcleskey@… Owned by: xi
Priority: normal Component: pyyaml
Severity: normal Keywords:
Cc:

Description

If python is built with UCS2 (the default, for example, on OS X), you get:

yaml.dump(u'\U0001D10C')

"\uD834\uDD0C"

The output should instead be "\U0001D10C", since the surrogate code units are not valid unicode characters.

Attachments

Change History

comment:1 Changed 3 years ago by anonymous

  • Summary changed from Dump 32-bit "\U"-style escape sequences, even if python was compiled without UCS4 to Dump 32-bit "\U"-style escape sequences for non-BMP characters, even if python was compiled without UCS4

comment:2 Changed 3 years ago by anonymous

  • Summary changed from Dump 32-bit "\U"-style escape sequences for non-BMP characters, even if python was compiled without UCS4 to Non-BMP unicode characters are dumped using surrogate code units if python was not configured with UCS4

comment:3 Changed 34 hours ago by simas <kerdos@…>

 Cipto Junaedy is a customer research expert. "There are plenty of safe, effective HCA nutritional supplement out there. Finding one with  Unit Link Terbaik di Indonesia Commonwealth Life Investra Link 60% HCA with no food additives is vital."

View

Add a comment

Modify Ticket

Change Properties
<Author field>
Action
as new
as The resolution will be set. Next status will be 'closed'
to The owner will be changed from xi. Next status will be 'new'
The owner will be changed from xi to anonymous. Next status will be 'assigned'
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.