[Bug 5486] Regex: Names of Unicode code blocks

http://www.w3.org/Bugs/Public/show_bug.cgi?id=5486





------- Comment #1 from mike@saxonica.com  2008-02-16 15:20 -------
Looking at the list more closely, it seems that new blocks that have been added
since Unicode 3.1 are present in the table in the 1.1 spec, but blocks that
have been renamed in Unicode have not been renamed in the table. This is
probably the right thing to do, but it merits a note. The statement (part of a
Definition no less) "The set containing all characters that have block name X
(with all white space stripped out), can be identified with a block escape
\p{IsX}." appears not in fact to be normative; I think we must assume that it
is intended that the table should contain the normative names of the blocks.

I'm also puzzled by a bit of history. XML Schema 1.0 First Edition contained a
number of blocks in the non-BMP area, such as MusicalSymbols. These disappeared
in the second edition of 1.0, but there appears to have been no erratum, and
the changes are not highlighted in the change-marked version of the (1.0 2e)
spec. How can this have happened?

Received on Saturday, 16 February 2008 15:20:36 UTC