W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > January 2011

[Bug 11910] @id values in polyglot markup should be XML-valid (or not?)

From: <bugzilla@jessica.w3.org>
Date: Fri, 28 Jan 2011 17:48:04 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1PisQW-0007rD-OV@jessica.w3.org>

--- Comment #4 from Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no> 2011-01-28 17:48:04 UTC ---
(In reply to comment #3)
> (In reply to comment #2)

> So long as the
> document sticks to the single aim of specifying how to get conforming html
> documents to have equivalent parse trees if parsed as XML, then xml validity is
> irrelevant.

I agree that it would be very good if it could stick to that single goal.
Because then this document would fit on *all* flavours of XML/XHTML. And would
thus become a true replacement for  Appendix C of XHTML 1.

But if so, then Henri's idea that it should be a subset of XML 1.0 and HTML5,
does not 100% fit. It only fits because it is HTML5 that defines how the HTML
DOM looks like.

Even HTML5 defines things as forbidden, that does not constitute a problem with
regad to DOM-equality. One thing I mentioned in this very bug is the fact that
HTML5 forbids space characters inside its @id attribute. This is not a problem
from a DOM-equality pointof view.

Perhaps one way to get this document turned into that direction, could be to
split it sharply in two parts: One part which describes the general rules. And
another part which uses HTML5 as an example. It would then be up to those that
uses other flavours of XHTML to find out what it would mean for them to be

In that regard, the use of <?xml version="1.0" encoding="UTF-8" ?>, is not an
DOM issue - except in one particular version of Internet Explorer - IE6 (and
earlier). It is far worse that if you place a <!--comment--> in front of the
<!DOCTYPE html>, then this triggers quirks-mode in *all* versions of Internet
Explorer. And this even happens if the comment comes between the XML
declaration and the DOCTYPE. Thus, this triggers quirks-mode in IE6 to IE9,
whether you remove the XML declaration or not:

<?xml version="1.0" encoding="UTF-8" ?>
<!--Hello, IE -->
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">

(Test it yourself or read

In other words, the forbiddance of the xml declaration is only a conformance
thing. Anyway, what do you think about splitting the document as I suggested?

Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Friday, 28 January 2011 17:48:06 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:04 UTC