Re: Fallback to UTF-8

On 24 Apr 2008, at 21:04, Jukka K. Korpela wrote:
> Henri Sivonen wrote:
>
>> Considering the real Web content, it is better to pick Windows-1252
>> than a hypothetical generic encoding.
>
> No, it's not because _in validation_ you don't need to make any  
> guess on
> the meanings of octets > 127 decimal. You're not supposed to render  
> them
> (apart from echoing them along with error messages, but they're not
> markup-significant) or to process them in any way but treating them as
> data characters.

The validator outputs both parts of the original source and it's own  
error messages. So whatever it outputs, it has to do so in a fashion  
compatible with the original document.

Would outputting entities for its own messages would solve that problem?

-- 
David Dorward
http://dorward.me.uk/
http://blog.dorward.me.uk/

Received on Thursday, 24 April 2008 21:15:35 UTC