W3C home > Mailing lists > Public > www-validator-cvs@w3.org > July 2007

[Bug 4867] override encoding info in XML prolog to reflect transcoding

From: <bugzilla@wiggum.w3.org>
Date: Thu, 19 Jul 2007 05:06:08 +0000
To: www-validator-cvs@w3.org
Message-Id: <E1IBODI-0004P6-UY@wiggum.w3.org>


ot@w3.org changed:

           What    |Removed                     |Added
           Severity|normal                      |major
             Status|NEW                         |ASSIGNED
            Summary|non UTF-8 pages cause XML   |override encoding info in
                   |error although it doesn't   |XML prolog to reflect
                   |have                        |transcoding
   Target Milestone|---                         |0.8.0

------- Comment #1 from ot@w3.org  2007-07-19 05:06 -------
Nice catch Masataka, thanks a lot.

I found out that the problem was with 
<?xml version="1.0" encoding="Shift_JIS"?>
which causes the XML parser to read the XML content as shift-jis, even though
the validator systematically transcodes everything to UTF-8 without passing it
to the different parsers.

I'm looking at whether I can tell the XML parser to ignore the encoding="..."
or whether I should be rewriting the value to be UTF-8.
Received on Thursday, 19 July 2007 05:06:14 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:17:29 UTC