W3C home > Mailing lists > Public > www-zig@w3.org > March 2002

Octet Strings and utf-8

From: Ray Denenberg <rden@loc.gov>
Date: Wed, 13 Mar 2002 16:21:51 -0500
Message-ID: <3C8FC2EF.CDCF9F8F@loc.gov>
CC: zig <www-zig@w3.org>
"LeVan,Ralph" wrote:

> Since all the clients I talk to send me their terms in the general option,
> and since I must somehow interpret the bytes in that field, I have to do a
> conversion anyway.  So, why shouldn't the UTF-8 negotiation apply?

Because the character set negotiation definition explicity applies to
InternationalString only.  If you arbitrarily decide that a particular
octetString type should be affected, well then what about for example the
referenceId?  Is that subject to utf-8 negotiation?  "What a silly question",
you say, "the reference id is binary". Well don't forget, the term can be
binary too.  How do you decide whether a given octet string is binary or
character?

--Ray
Received on Wednesday, 13 March 2002 16:21:14 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 29 October 2009 06:12:22 GMT