W3C home > Mailing lists > Public > www-dom@w3.org > April to June 2001

Re: Unicode character classes and XML parser

From: John Cowan <jcowan@reutershealth.com>
Date: Thu, 19 Apr 2001 16:59:35 -0400
Message-ID: <3ADF51B7.6050500@reutershealth.com>
To: "John G. Spragge" <jgs@dancing-cat-software.com>
CC: "'www-dom@w3.org'" <www-dom@w3.org>
John G. Spragge wrote:

 > Characters which have a font or compatibility decomposition
 > (i.e. those  with a "compatibility formatting tag" in field 5
 > of the database -- marked by field 5 beginning with a "<")
 > are not allowed.

 > Question from an implementor of the parser: does this mean
 > xml excludes characters with decompositions altogether (presumably
 > to avoid normalisation issues), or does it mean xml identifiers
 > exclude such characters? Thanks...

The constraint is on XML names (identifiers) only.  Compatibility
characters elsewhere in XML documents are only discouraged, not

There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein
Received on Thursday, 19 April 2001 16:58:38 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 22 June 2012 06:13:48 GMT