What an XML processor must/may not tell a processor

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

I've just searched the XML spec. for what it says about information
passed to processors.  Here's a tabulation of my findings --
comments/corrections welcome:

Required for all processors

 PIs
 Non-markup character data
 Attribute+default if declared and missing
 Name and external idents of notations in attr val, attr def or entity
  decl

 [implicitly, attribute values, since we have the prose "Before the
  value of an attribute is passed to the application"]

 [implicitly, attributes, since we have their values]

Required for validating processors

 EC-Whitespace status
 Unparsed entity SYS/PUB Idents and Notation

Required for non-validating

 Unread-entity notification

Optional

 Content of comments
 [implicitly, comments as such ??]

Unclear

 Start and end tags
 Declarations and definitions

I believe that it's been remarked before that start/end tags are not
explicitly required to be reported, but everybody acts as if they are.
In any case, I'd be very interested to hear arguments either way wrt
these last two. . .

ht
- -- 
       Henry S. Thompson, School of Informatics, University of Edinburgh
      10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
                Fax: (44) 131 651-1426, e-mail: ht@inf.ed.ac.uk
                       URL: http://www.ltg.ed.ac.uk/~ht/
 [mail from me _always_ has a .sig like this -- mail without it is forged spam]
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)

iD8DBQFMPHzjkjnJixAXWBoRAvcdAJ997Y5Rf3Khl/Dyvmh7kX+UowsFagCdGu/t
/oSGl4KdwXcPA6svv/tbaDM=
=heLp
-----END PGP SIGNATURE-----

Received on Tuesday, 13 July 2010 14:49:37 UTC