W3C home > Mailing lists > Public > www-validator@w3.org > August 2007

Re: non sgml characters

From: Jukka K. Korpela <jkorpela@cs.tut.fi>
Date: Mon, 6 Aug 2007 16:40:54 +0300 (EEST)
To: Cristina Fiorentini <c.fiorentini@comune.fe.it>
cc: www-validator@w3.org
Message-ID: <Pine.SOC.4.64.0708061636250.5172@hopeatilhi.cs.tut.fi>

On Mon, 6 Aug 2007, Cristina Fiorentini wrote:

> I try to validate a web page containing non sgml characters such as MS Word 
> double quotes but the new validator doesn't check this type of error.

I cannot reproduce the problem: the current validator reports e.g.
   non SGML character number 148
when the document contains octet 148 (decimal), i.e. the octet that 
represents the right double quotation mark in windows-1252, and the 
document's encoding is declared as iso-8859-1. So no change.

By the way, such octets aren't really a problem (in validation or 
otherwise) if you declare the encoding as windows-1252.

> NOTE: Whenever possible, give the address of the document you were checking.

Did you notice that note? :-)

-- 
Jukka "Yucca" Korpela, http://www.cs.tut.fi/~jkorpela/
Received on Monday, 6 August 2007 13:41:04 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:25 GMT