W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > June 2010

[Bug 10014] New: XHTML1 Doctypes cause DOM diff in XML parsers with external entity handling

From: <bugzilla@jessica.w3.org>
Date: Fri, 25 Jun 2010 19:43:44 +0000
To: public-html-bugzilla@w3.org
Message-ID: <bug-10014-2486@http.www.w3.org/Bugs/Public/>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=10014

           Summary: XHTML1 Doctypes cause DOM diff in XML parsers with
                    external entity handling
           Product: HTML WG
           Version: unspecified
          Platform: Macintosh
        OS/Version: Mac System 9.x
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HTML/XHTML Compatibility Authoring Guide (ed: Eliot
                    Graff)
        AssignedTo: eliotgra@microsoft.com
        ReportedBy: xn--mlform-iua@xn--mlform-iua.no
         QAContact: public-html-bugzilla@w3.org
                CC: mike@w3.org, public-html@w3.org,
                    eliotgra@microsoft.com


Henri in bug In bug 9965, regarding CDATA and non-CDATA attributes:

Comment #12 From Henri Sivonen 2010-06-22 07:17:09 -------
With <!DOCTYPE html>, all attribute are CDATA attributes.

However, if you use one of the permitted XHTML 1.0 doctypes and the XML
processor processes external entities, some attributes aren't CDATA attributes.
As for it being an issue, the document tree would be *different* which implies
not polyglot.
----------------

So, there should be 4 options:

1) Say that the DOM differences are acceptable
2) Variant of 1): Say  that polyglot spec is only meant for HTML parsers and
XML processor without external entity handling
3) Declare XHML1.x doctypes as incompatible with polyglot spec.
4) Require such strict authoring requirements that document is treated the same
way, regardless of DTD/DOCTYPE

In case of 3) then that also decides what kind of DTD based doctypes that could
be permitted: Only those that do not declare anything _but_ CDATA attributes.

This is a judgement call: even when it comes to CDATA attributes, then there
are differences - namely: HTML parsers do not perform any XML-like
normalization. So, it would be possible to say that even CDATA attributes
cannot become similar enough.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Friday, 25 June 2010 19:43:46 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 25 June 2010 19:43:49 GMT