W3C home > Mailing lists > Public > www-xml-schema-comments@w3.org > October to December 2000

RE: Surrogate blocks as block escapes in CR-xmlschema-2-20001024

From: Biron,Paul V <Paul.V.Biron@kp.org>
Date: Tue, 5 Dec 2000 11:13:42 -0800
Message-Id: <376E771642C1D2118DC300805FEAAF43014BA82A@pars-exch-1.ca.kp.org>
To: "'Tony Graham'" <tgraham@mulberrytech.com>, www-xml-schema-comments@w3.org
> -----Original Message-----
> From:	Tony Graham [SMTP:tgraham@mulberrytech.com]
> Sent:	Friday, November 24, 2000 12:57 PM
> To:	www-xml-schema-comments@w3.org
> Subject:	Surrogate blocks as block escapes in CR-xmlschema-2-20001024
> 
> The regular expression syntax includes block and category escapes of
> the form '\p{IsX}', where 'X' is either a one- or two-character
> character property identifier or a Unicode character block name with
> spaces stripped out.
> 
> The table of character properties in the CR excludes the 'Cs' property
> and notes that "surrogate" characters 'do not occur at the level of
> the "character abstraction" that XML instance documents operate on.'
> 
> The CR refers to the Unicode 3.0 blocks but does not list them.  The
> Unicode 3.0 blocks include three that cover the Surrogates area: "High
> Surrogate", "High Private Use Surrogates" and "Low Surrogates".
> 
> Since Surrogates 'do not occur at the level of the "character
> abstraction" that XML instance documents operate on', should the CR
> note that the surrogate-related blocks should not be used in block
> escapes in XML Schema regular expressions?
> 
Yes, that's a good catch.  I will make that change to the not on surrogates.

thanx,

pvb
Received on Tuesday, 5 December 2000 14:27:28 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Sunday, 6 December 2009 18:12:49 GMT