Re: XML question for the experts

/ John Cowan <cowan@ccil.org> was heard to say:
| Grosso, Paul scripsit:
|
|> If a serialized XML document contains:
|> 
|> <!--This is a comment &#x2014; pbg-->
|> 
|> or
|> 
|> <?myproc pseudoatt="this is part of a pi &#x2014; pbg"?>
|> 
|> then when that is read by an XML processor, is the
|> &#x2014; considered to be a seven character string 
|> or the Unicode em-dash character?
|
| Clearly the former.  Comments and PIs contain simply Chars, which means
| that NCRs are not recognized in them.  Compare productions 15 (Comment)
| and 16 (PI) with 10 (AttValue) and 43 (Content).

John is clearly right. Not that I'd have implemented it correctly off
the top of my head :-)

                                        Be seeing you,
                                          norm

-- 
Norman Walsh <ndw@nwalsh.com> | All professional men are handicapped by
http://nwalsh.com/            | not being allowed to ignore things
                              | which are useless.-- Goethe

Received on Friday, 7 December 2007 22:08:11 UTC