- From: Klaus Weide <kweide@tezcat.com>
- Date: Tue, 10 Dec 1996 02:44:14 -0600 (CST)
- To: Koen Holtman <koen@win.tue.nl>
- cc: http-wg%cuckoo.hpl.hp.com@hplb.hpl.hp.com, www-international@w3.org
On Sun, 8 Dec 1996, Koen Holtman wrote: > Klaus Weide: > > > [...using feature negotiation to negotiate on UTF-8....] > > >Maybe it is the most practical way. But no mechanism is in place yet, > >while overloading the language header (and associated inventiveness with > >new HTML tags) can be done now... > > Overloading a HTTP header and adding HTML tags will take _much_ more time > than waiting for feature negotiation to be in place. Let's hope so :). However, with overloading I meant treating {Content,Accept}-Language headers (and related HTML tags or attributes) as carrying character repertoire meaning - which is happening now. > But skimming the UTF-8 specification, I gather that UTF-8 is an encoding > mechanism, not a character set. HTTP offers the > Accept-Encoding/Content-encoding headers to negotiate on this. Or does > using Accept-Encoding only shift the problem to negotiating which part > of UCS you can render? It would be *nice* if UTF-8 could be treated that way, like a C-E (or C-T-E, for mail). It could then be used for labelling and negotiation of character encoding orthogonal to the question of repertoire. But that isn't the case. > When we reviewed the Accept-* header definitions for HTTP/1.1 early this > year, we did not discuss the particular problem of character sets which > could only be partially rendered, as would often be the case with unicode > stuff. It is certainly possible that HTTP/1.1 cannot solve this problem, > and maybe HTTP/1.1 + feature negotation also can't solve it. [...] > > I don't think that a HTTP/HTML/Web specific > >feature tag registration can take over the IANA charset registry's > ^^^^^^^^^ > >function. > > We are not aiming to take over any existing IANA registry. I didn't mean that you were trying to do that. But using feature tags for negotiating (labelling) sub-repertoire, i.e. "which characters can be (are) used" with utf-8, would effectively amount to using them for a function that could up to recently be done using IANA registered charsets alone. Klaus
Received on Tuesday, 10 December 1996 03:44:09 UTC