W3C home > Mailing lists > Public > www-xml-schema-comments@w3.org > January to March 2008

[Bug 5486] Regex: Names of Unicode code blocks

From: <bugzilla@wiggum.w3.org>
Date: Sat, 16 Feb 2008 15:20:31 +0000
To: www-xml-schema-comments@w3.org
Message-Id: <E1JQOq7-00070g-HX@wiggum.w3.org>


------- Comment #1 from mike@saxonica.com  2008-02-16 15:20 -------
Looking at the list more closely, it seems that new blocks that have been added
since Unicode 3.1 are present in the table in the 1.1 spec, but blocks that
have been renamed in Unicode have not been renamed in the table. This is
probably the right thing to do, but it merits a note. The statement (part of a
Definition no less) "The set containing all characters that have block name X
(with all white space stripped out), can be identified with a block escape
\p{IsX}." appears not in fact to be normative; I think we must assume that it
is intended that the table should contain the normative names of the blocks.

I'm also puzzled by a bit of history. XML Schema 1.0 First Edition contained a
number of blocks in the non-BMP area, such as MusicalSymbols. These disappeared
in the second edition of 1.0, but there appears to have been no erratum, and
the changes are not highlighted in the change-marked version of the (1.0 2e)
spec. How can this have happened?
Received on Saturday, 16 February 2008 15:20:36 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 14:50:07 UTC