Re: Unicode character classes and XML parser

John G. Spragge wrote:

 > Characters which have a font or compatibility decomposition
 > (i.e. those  with a "compatibility formatting tag" in field 5
 > of the database -- marked by field 5 beginning with a "<")
 > are not allowed.

 > Question from an implementor of the parser: does this mean
 > xml excludes characters with decompositions altogether (presumably
 > to avoid normalisation issues), or does it mean xml identifiers
 > exclude such characters? Thanks...

The constraint is on XML names (identifiers) only.  Compatibility
characters elsewhere in XML documents are only discouraged, not
forbidden.


-- 
There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein

Received on Thursday, 19 April 2001 16:58:38 UTC