- From: Philip Jägenstedt <philipj@opera.com>
- Date: Tue, 07 Sep 2010 15:16:28 +0200
On Tue, 07 Sep 2010 14:56:38 +0200, Boris Zbarsky <bzbarsky at mit.edu> wrote: > On 9/7/10 4:11 AM, Philip J?genstedt wrote: >> It's garbage in at least UTF-8, Big5 and GBK. > > Thanks. I assume that applies to the OggS\0 sequence too, right? I > appreciate the data! UTF-8, Big5 and GBK are all (as far as I know) ASCII supersets. Do real-world text documents include \0 bytes? (I don't know.) >> I'm not sure what infrastructure is in place, but perhaps one could >> *not* sniff if Content-Type also indicates an encoding? > > As long as "indicates an encoding" doesn't include UTF-8 or ISO-8859-1 > (thanks, Apache!), that should be reasonable, I think. Are you saying that Apache has, at various times, set the default character encoding to UTF-8 or ISO-8859-1? I was hoping that no encoding parameter at all would be sent :/ -- Philip J?genstedt Core Developer Opera Software
Received on Tuesday, 7 September 2010 06:16:28 UTC