W3C home > Mailing lists > Public > public-html@w3.org > August 2007

Re: numeric character references and Unicode surrogate pairs: part of my review of 8 The HTML syntax

From: Cameron McCormack <cam@mcc.id.au>
Date: Mon, 20 Aug 2007 23:14:07 +1000
To: public-html WG <public-html@w3.org>
Message-ID: <20070820131407.GA14640@arc.mcc.id.au>

Robert Burns:
> >I believe this is not consistent with existing browser behavior. That is  
> >that while surrogate pairs, expressed as pairs of numeric character  
> >references, are not supposed to resolve to the non-BMP character,  
> >browsers do it anyway.

Anne van Kesteren:
> Do you have any tests to demonstrate that?

Here’s one:

  data:text/html,%26%23xD800%3B%26%23xDC00%3B

Shows as a single U+10000 character in Firefox 2.0.0.5 and Opera 9.23,
at least.

-- 
Cameron McCormack, http://mcc.id.au/
	xmpp:heycam@jabber.org  ▪  ICQ 26955922  ▪  MSN cam@mcc.id.au
Received on Monday, 20 August 2007 13:14:17 UTC

This archive was generated by hypermail 2.3.1 : Monday, 29 September 2014 09:38:48 UTC