W3C home > Mailing lists > Public > www-international@w3.org > October to December 2000

surrogates for XML

From: Yves <yves@opentag.com>
Date: Mon, 09 Oct 2000 15:15:05 +0900
Message-Id: <>
To: www-international@w3.org

I have a question about Unicode surrogates and XML:

The XML specifications define the range of valid characters to be:

Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF]

Explicitely excluding the surrogates blocks. But the scalar values 0x10000 
to 0x10FFFF seems to indicate that surrogates are supported... I'm not sure 
I understand. In addition, The Unicode version 3.0 also gives formulas to 
go back and forth between surrogates pairs and scalar values, mentioning 
their need for XML (section 3.7).

I would appreciate a lot if someone could someone cold give me more 
information on how surrogates are supported on XML?


-yves savourel
Received on Monday, 9 October 2000 08:20:54 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:20 UTC