- From: Mark Nottingham <mnot@mnot.net>
- Date: Wed, 26 Mar 2008 00:15:38 +1100
- To: HTTP Working Group <ietf-http-wg@w3.org>
Trying to summarise, I think there are two separable issues here; * 1. Should HTTPBIS continue to accommodate historical clients that assume that an unlabeled text/* type is iso-8859-1, rather than MIME's default ASCII? Roy has argued that this is an important distinction that we should continue to make, as otherwise existing implementations will become non-conformant. Frank points out that no such implementations are in common use today, and that those implementations which did make this assumption have greater problems (e.g., lacking Host headers). It would be good to hear if anyone else has an opinion, especially if they have experience with / information about such clients, or content which relies upon this default. The conservative thing to do seems to be to keep the status quo. If we do that, rather than just close the issue as WONTFIX, we could modify the current text to clarify the defaulting (the original question was one of precedence between HTTP defaulting and that defined by the media type in question), and perhaps give a bit of the history. * 2. Should HTTPBIS countenance sniffing for character set on text/* types? a. ...when the charset parameter is not present? b. ...when the charset parameter is iso-8859-1 c. ...at other times? A few people have noted a security issue in a widely-used browser that requires (b). However, I haven't seen a reference to a vulnerability report, etc. yet; is anyone aware of one? Some people have spoken in favour of (a), but I note with interest this text in p3, 3.1.1; > Some HTTP/1.0 software has interpreted a Content-Type header without > charset parameter incorrectly to mean "recipient should guess." > Senders wishing to defeat this behavior MAY include a charset > parameter even when the charset is ISO-8859-1 ([ISO-8859-1]) and > SHOULD do so when it is known that it will not confuse the recipient. If we allow either (a) or (b), this will have to be re-worked. Also, it's notable that allowing (a) may make (1) easier to resolve in favour of dropping the HTTP-specific default; i.e., the default would shift from iso-8859-1 to "sniff". Does anyone think that these are so intertwined that (2) should not be a separate issue? If we can resolve it, (1) should follow. Cheers, -- Mark Nottingham http://www.mnot.net/
Received on Tuesday, 25 March 2008 13:16:27 UTC