- From: Biron,Paul V <Paul.V.Biron@kp.org>
- Date: Tue, 5 Dec 2000 11:13:42 -0800
- To: "'Tony Graham'" <tgraham@mulberrytech.com>, www-xml-schema-comments@w3.org
> -----Original Message-----
> From: Tony Graham [SMTP:tgraham@mulberrytech.com]
> Sent: Friday, November 24, 2000 12:57 PM
> To: www-xml-schema-comments@w3.org
> Subject: Surrogate blocks as block escapes in CR-xmlschema-2-20001024
>
> The regular expression syntax includes block and category escapes of
> the form '\p{IsX}', where 'X' is either a one- or two-character
> character property identifier or a Unicode character block name with
> spaces stripped out.
>
> The table of character properties in the CR excludes the 'Cs' property
> and notes that "surrogate" characters 'do not occur at the level of
> the "character abstraction" that XML instance documents operate on.'
>
> The CR refers to the Unicode 3.0 blocks but does not list them. The
> Unicode 3.0 blocks include three that cover the Surrogates area: "High
> Surrogate", "High Private Use Surrogates" and "Low Surrogates".
>
> Since Surrogates 'do not occur at the level of the "character
> abstraction" that XML instance documents operate on', should the CR
> note that the surrogate-related blocks should not be used in block
> escapes in XML Schema regular expressions?
>
Yes, that's a good catch. I will make that change to the not on surrogates.
thanx,
pvb
Received on Tuesday, 5 December 2000 14:27:28 UTC