W3C home > Mailing lists > Public > www-international@w3.org > October to December 2000

surrogates for XML

From: Yves <yves@opentag.com>
Date: Mon, 09 Oct 2000 15:15:05 +0900
Message-Id: <4.2.0.58.J.20001009151455.009c83f0@sh.w3.mag.keio.ac.jp>
To: www-international@w3.org

I have a question about Unicode surrogates and XML:

The XML specifications define the range of valid characters to be:

Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF]

Explicitely excluding the surrogates blocks. But the scalar values 0x10000 
to 0x10FFFF seems to indicate that surrogates are supported... I'm not sure 
I understand. In addition, The Unicode version 3.0 also gives formulas to 
go back and forth between surrogates pairs and scalar values, mentioning 
their need for XML (section 3.7).

I would appreciate a lot if someone could someone cold give me more 
information on how surrogates are supported on XML?

Thanks.

-yves savourel
Received on Monday, 9 October 2000 08:20:54 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:55 GMT