W3C home > Mailing lists > Public > www-html-editor@w3.org > July to September 2001

Canonical XHTML

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sun, 12 Aug 2001 22:50:37 +0200
To: www-html-editor@w3.org
Message-ID: <kvqdnt8v1cb3roshma8e8j3jsglk9m0t9h@4ax.com>

   The canonical form of XML documents af defined by [XMLC14N] is not
usable for XHTML since it doesn't take the additional whitespace
normalization requirements of [XHTML10] into account. I encourage the
HTML WG to publish a note/recommendation that defines "Canonical XHTML".
This is basically very simple since it could simply say

  The canonical form of an XHTML document is the result of applying the
  normalizing whitespace according to to section 3.2 of XHTML 1.0 and
  applying the rules of Canonical XML 1.0 to the document.

This would be usable for applications focussed on XHTML documents as for
example search engines trying to determine whether something significant
was changend in a certain XHTML document.

[XMLC14N] http://www.w3.org/TR/2001/REC-xml-c14n-20010315
[XHTML10] http://www.w3.org/TR/2000/REC-xhtml1-20000126 

Björn Höhrmann { mailto:bjoern@hoehrmann.de } http://www.bjoernsworld.de
am Badedeich 7 } Telefon: +49(0)4667/981028 { http://bjoern.hoehrmann.de
25899 Dagebüll { PGP Pub. KeyID: 0xA4357E78 } http://www.learn.to/quote/
Received on Sunday, 12 August 2001 16:51:13 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:08:29 UTC