W3C home > Mailing lists > Public > www-validator@w3.org > April 2008

Re: Fallback to UTF-8

From: David Dorward <david@dorward.me.uk>
Date: Thu, 24 Apr 2008 22:14:58 +0100
Message-Id: <E1748819-33BA-43CB-BAC5-89B54FF6B2AA@dorward.me.uk>
To: www-validator@w3.org


On 24 Apr 2008, at 21:04, Jukka K. Korpela wrote:
> Henri Sivonen wrote:
>
>> Considering the real Web content, it is better to pick Windows-1252
>> than a hypothetical generic encoding.
>
> No, it's not because _in validation_ you don't need to make any  
> guess on
> the meanings of octets > 127 decimal. You're not supposed to render  
> them
> (apart from echoing them along with error messages, but they're not
> markup-significant) or to process them in any way but treating them as
> data characters.

The validator outputs both parts of the original source and it's own  
error messages. So whatever it outputs, it has to do so in a fashion  
compatible with the original document.

Would outputting entities for its own messages would solve that problem?

-- 
David Dorward
http://dorward.me.uk/
http://blog.dorward.me.uk/
Received on Thursday, 24 April 2008 21:15:35 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:29 GMT