W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > June 2010

[Bug 9965] White space in attributes - please justify why multiple spaces/linebreaks are problematic - or delete this requirement

From: <bugzilla@jessica.w3.org>
Date: Mon, 21 Jun 2010 15:49:26 +0000
To: public-html-bugzilla@w3.org
Message-Id: <E1OQjFW-0006F4-Kp@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=9965





--- Comment #6 from Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>  2010-06-21 15:49:26 ---
(In reply to comment #5)
> (In reply to comment #4)

> Just to be clear: _multiple_ consequive spaces are taken care of in XML,
> according to the DOM inspectors I have consulted.

My conclusion: if there are problems with regard to what multiple spaces, then
it is related to differences between XML parsers. Thus, multiple spaces in
_CDATA_ attributes, is not a polyglot HTML/XML issue. The polyglot spec might
still forbid it, but if so, it has to justify it by pointing to differences
inside XML - and not to differences between XML and HTML. It is not, I think,
the task of the Polyglot spec to solve all problems on the XML side.

See: http://www.usingxml.com/Basics/XmlSpace#AttributeWhiteSpaceHandling

By the way, it seems like XML Canonicalisation (C14N) is related to the
Polyglot spec 

See: http://www.usingxml.com/Basics/XmlSpace#TreatingWhiteSpaceinaUniformWay
And: http://www.w3.org/TR/xml-c14n

For example, according to C14N, 

* then <li/> should be written as <li></li>
* in C14N, then only UTF-8 is valid.
* in C14N, then class="   one     two" becomes class="one two" 
   see http://www.w3.org/TR/xml-c14n#Example-Chars

If the Polyglot spec really should be more of "XHML/HTML with helmets" kind of
spec (Sam used that idio once) , rather than a spec that identfies the common
ground between XHTML5 and HTML5, then it would be a good idea to take C14N as
starting point.

C14N also lays out rules for how to write numeric character reference ... e.g.
trailing zeros are forbidden. E.g. &#x0d; becomes &#xD; Again, see
http://www.w3.org/TR/xml-c14n#Example-Chars
Removing trailing zeros in numeric character references *is* more IE
compatible, for instance ...

I am not certain that I want the strict rules of C14N, though.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Monday, 21 June 2010 15:49:28 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 21 June 2010 15:49:31 GMT