[Bug 4867] override encoding info in XML prolog to reflect transcoding

http://www.w3.org/Bugs/Public/show_bug.cgi?id=4867


ot@w3.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |major
             Status|NEW                         |ASSIGNED
            Summary|non UTF-8 pages cause XML   |override encoding info in
                   |error although it doesn't   |XML prolog to reflect
                   |have                        |transcoding
   Target Milestone|---                         |0.8.0




------- Comment #1 from ot@w3.org  2007-07-19 05:06 -------
Nice catch Masataka, thanks a lot.

I found out that the problem was with 
<?xml version="1.0" encoding="Shift_JIS"?>
which causes the XML parser to read the XML content as shift-jis, even though
the validator systematically transcodes everything to UTF-8 without passing it
to the different parsers.

I'm looking at whether I can tell the XML parser to ignore the encoding="..."
or whether I should be rewriting the value to be UTF-8.

Received on Thursday, 19 July 2007 05:06:14 UTC