- From: Biron,Paul V <Paul.V.Biron@kp.org>
- Date: Tue, 5 Dec 2000 11:13:42 -0800
- To: "'Tony Graham'" <tgraham@mulberrytech.com>, www-xml-schema-comments@w3.org
> -----Original Message----- > From: Tony Graham [SMTP:tgraham@mulberrytech.com] > Sent: Friday, November 24, 2000 12:57 PM > To: www-xml-schema-comments@w3.org > Subject: Surrogate blocks as block escapes in CR-xmlschema-2-20001024 > > The regular expression syntax includes block and category escapes of > the form '\p{IsX}', where 'X' is either a one- or two-character > character property identifier or a Unicode character block name with > spaces stripped out. > > The table of character properties in the CR excludes the 'Cs' property > and notes that "surrogate" characters 'do not occur at the level of > the "character abstraction" that XML instance documents operate on.' > > The CR refers to the Unicode 3.0 blocks but does not list them. The > Unicode 3.0 blocks include three that cover the Surrogates area: "High > Surrogate", "High Private Use Surrogates" and "Low Surrogates". > > Since Surrogates 'do not occur at the level of the "character > abstraction" that XML instance documents operate on', should the CR > note that the surrogate-related blocks should not be used in block > escapes in XML Schema regular expressions? > Yes, that's a good catch. I will make that change to the not on surrogates. thanx, pvb
Received on Tuesday, 5 December 2000 14:27:28 UTC