Re: numeric character references and Unicode surrogate pairs: part of my review of 8 The HTML syntax

Robert Burns:
> >I believe this is not consistent with existing browser behavior. That is  
> >that while surrogate pairs, expressed as pairs of numeric character  
> >references, are not supposed to resolve to the non-BMP character,  
> >browsers do it anyway.

Anne van Kesteren:
> Do you have any tests to demonstrate that?

Here’s one:

  data:text/html,%26%23xD800%3B%26%23xDC00%3B

Shows as a single U+10000 character in Firefox 2.0.0.5 and Opera 9.23,
at least.

-- 
Cameron McCormack, http://mcc.id.au/
 xmpp:heycam@jabber.org  ▪  ICQ 26955922  ▪  MSN cam@mcc.id.au

Received on Monday, 20 August 2007 13:14:17 UTC