W3C home > Mailing lists > Public > www-tag@w3.org > December 2005

Re: IRIEverywhere-27

From: Jon Hanna <jon@hackcraft.net>
Date: Tue, 13 Dec 2005 14:45:07 +0000
Message-ID: <439EDE73.8010207@hackcraft.net>
To: www-tag@w3.org

Bjoern Hoehrmann wrote:
> As XML and most formats based on XML allow use of non-Unicode encodings,
> allowing IRIs in such formats would make the formats inconsistent with
> the architectural requirements set forth in the reference processing
> model http://www.w3.org/TR/2005/REC-charmod-20050215/#sec-RefProcModel
> and http://www.w3.org/TR/2005/REC-charmod-20050215/#C014 in particular.

Strings in XML documents are in the UCS. They may be encoded in 
encodings that do not directly handle the full UCS, using character 
entities as necessary, but that is a matter already handled by XML.

When an IRI is contained in an XML document it is the UCS string 
conveyed by the XML, not the other-wise encoded string that is an 
artefact of the XML document's encoding, that is the IRI, as has always 
been the case with URIs (and in HTML documents back when we talked of 
URLs rather than URIs).

Encoding is irrelevant.
Received on Tuesday, 13 December 2005 14:40:02 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:56:10 UTC