Canonical XHTML

Hi,

   The canonical form of XML documents af defined by [XMLC14N] is not
usable for XHTML since it doesn't take the additional whitespace
normalization requirements of [XHTML10] into account. I encourage the
HTML WG to publish a note/recommendation that defines "Canonical XHTML".
This is basically very simple since it could simply say

  The canonical form of an XHTML document is the result of applying the
  normalizing whitespace according to to section 3.2 of XHTML 1.0 and
  applying the rules of Canonical XML 1.0 to the document.

This would be usable for applications focussed on XHTML documents as for
example search engines trying to determine whether something significant
was changend in a certain XHTML document.

[XMLC14N] http://www.w3.org/TR/2001/REC-xml-c14n-20010315
[XHTML10] http://www.w3.org/TR/2000/REC-xhtml1-20000126 

regards,
-- 
Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/

Received on Sunday, 12 August 2001 16:51:13 UTC