- From: <bugzilla@wiggum.w3.org>
- Date: Thu, 19 Jul 2007 01:00:13 +0000
- To: www-validator-cvs@w3.org
- CC:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=4867 Summary: non UTF-8 pages cause XML error although it doesn't have Product: Validator Version: 0.8.0b2 Platform: PC URL: http://www.mitsue.co.jp/ OS/Version: Windows XP Status: NEW Severity: normal Priority: P2 Component: Parser AssignedTo: dave.null@w3.org ReportedBy: yakura-masataka@mitsue.co.jp QAContact: www-validator-cvs@w3.org There seems to be a bug in the new XML parser. It doesn't recognize some Japanese encodings other than UTF, such as Shift_JIS, EUC-JP. Try validating http://www.mitsue.co.jp/ , you'll see some XML errors. But try saving the page in an XML format (mitsue.xml) and opening it in Firefox and Internet Explorer, I got no such errors. Rewrite the source substituting "shift_jis" for "UTF-8" and it will validate. Thus, the validator seems to have some encoding detection and handling issues. There are so many webpages with Shift_JIS or EUC-JP or whatever non-UTF. I'm afraid that launching the new validator without fixing that issue would cause serious confusion in Japanese market.
Received on Thursday, 19 July 2007 01:00:17 UTC