W3C home > Mailing lists > Public > public-qa-dev@w3.org > April 2007

Non SGML char errors with validator HEAD

From: Ville Skyttä <ville.skytta@iki.fi>
Date: Sat, 28 Apr 2007 15:05:13 +0300
To: "QA-dev" <public-qa-dev@w3.org>
Message-Id: <200704281505.13824.ville.skytta@iki.fi>

Hello,

I'm running the HEAD validator locally (Fedora Core 6 x86_64, OpenSP 1.5.2 
from Fedora Core, SGML::Parser::OpenSP 0.99 locally built), and I'm getting 
non SGML char errors where validator-test nor qa-dev's HEAD setup shows them.

Ideas where to look for the problem?  I saw 
http://www.w3.org/Bugs/Public/show_bug.cgi?id=3164 but I gather it's 
supposedly already fixed in SPO 0.99, and I'm not sure if it's even the same 
issue.

For example, when validating http://www.w3.org/, I get these:

Error  Line 529, Column 145: non SGML character number 155.
…ml:lang="zh-hans" lang="zh-hans">中���</span>

Error  Line 535, Column 150: non SGML character number 150.
…lang="de" lang="de">Deutschland und ��sterreich</span> (Germany and Austria)<

Line 537, Column 122: non SGML character number 149.
…rect"><span xml:lang="el" lang="el">��λλάδα</span>

Error  Line 540, Column 165: non SGML character number 153.
… xml:lang="zh-hant" lang="zh-hant">��港</span> (Hong Kong)</a></li>

Line 548, Column 136: non SGML character number 153.
…rect"><span xml:lang="he" lang="he">��שראל</span>

Line 548, Column 142: non SGML character number 144.
…<span xml:lang="he" lang="he">ישר��ל</span>

Line 548, Column 144: non SGML character number 156.
…pan xml:lang="he" lang="he">ישרא��</span>

Line 554, Column 124: non SGML character number 149.
…rect"><span xml:lang="ko" lang="ko">���국</span> (Korea)</a></li>

Line 554, Column 125: non SGML character number 156.
…ect"><span xml:lang="ko" lang="ko">��국</span> (Korea)</a></li>

Line 556, Column 129: non SGML character number 132.
…ct"><span xml:lang="ar" lang="ar">ا��مغرب</span>

Line 556, Column 131: non SGML character number 133.
…"><span xml:lang="ar" lang="ar">ال��غرب</span>
Received on Saturday, 28 April 2007 12:05:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 19 August 2010 18:12:48 GMT