Re: Fallbeck to UTF-8

On Thu, 29 Nov 2007, Frank Ellermann wrote:

>> (b) Take ISO-8859-1 as fallback encoding (the default of RFC 2616).
>>     This will "work" if no bytes from 0x80 to 0x9F are present -
>>     hence with many of the traditional 8-bit character sets.
>>     Otherwise (if some bytes from 0x80 to 0x9F are found),
>>     give the usual errors about "non SGML character number ..."
>
> That's a variation of the current UTF-8 default,

No, it isn't! Please try it!

When the fallback is UTF-8 and UTF-8 is impossible, then there
is this illogical report and no further validation.

When the fallback is ISO-8859-1 and some bytes from 0x80 to 0x9F
occur, then the page is really checked and - possibly among others -
error reports are made about "non sgml characters".

Received on Thursday, 29 November 2007 16:12:10 UTC