W3C home > Mailing lists > Public > ietf-charsets@w3.org > October to December 2002

RE: Comments on draft-yergeau-rfc2279bis-00.txt

From: Francois Yergeau <FYergeau@alis.com>
Date: Sat, 05 Oct 2002 12:43:30 -0400
To: ietf-charsets@iana.org
Message-id: <F7D4BDA0E5A1D14B99D32C022AEB73660EB2F8@alis-2k.alis.domain>

Simon Josefsson wrote:
> Those protocols should be extended to send this information, since it
> is not possible to handle text encoded in multiple encodings
> otherwise.

That would be fine in principle but mostly useless in practice.  The
situation with HTTP shows that very clearly.

>  I don't think it is for UTF-8 to solve these problems,
> which BOM partially tries to do.

The BOM is one instance of charset metainformation located *inside* the
entity (others are the HTML <meta> element and the XML encoding declaration)
instead of outside and therefore detachable.  Experience has shown such
metainformation to be useful and often the only one available.

>  (HTTP supports this, the current FTP
> extensions addresses it partially, lying implementations cannot be
> standardized away though.)

Indeed, protocol standards have limited powers.  In practice, even correct
implementations of correctly internationalized protocols cannot convey
information that is simply not available to them.  Better have belts *and*
suspenders in those cases.

Regards,

-- 
François
Received on Saturday, 5 October 2002 12:45:49 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 5 June 2006 15:10:54 GMT