[Bug 3289] utf8 web site causes tool to break

http://www.w3.org/Bugs/Public/show_bug.cgi?id=3289


ville.skytta@iki.fi changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         AssignedTo|ville.skytta@iki.fi         |ot@w3.org




------- Comment #2 from ville.skytta@iki.fi  2006-05-25 08:54 -------
The checking of http://www.amd.com/us-en/ seems to end halfway through on
validator.w3.org indeed.  However, that doesn't happen on qa-dev or my local 
box, even though they display the UTF-8 garbage warning too.  Enabling UTF-8
mode in HTML::Parser avoids the warning, but doesn't appear to fix the actual
problem.

Based on that, I'm inclined towards blaming Perl or HTML::Parser and
reassigning to Olivier for comments about upgrade possibilities.  Related
versions:
- validator.w3.org: perl 5.8.4, HTML::Parser 3.45
- qa-dev.w3.org: perl 5.8.8, HTML::Parser 3.54
- my local box: perl 5.8.8, HTML::Parser 3.51

Anyway, I have enabled UTF-8 mode in the CVS version of the link checker.  If I
understand the docs correctly it should be a good thing to do in any case.

Reproducers:
http://validator.w3.org/checklink?uri=http%3A%2F%2Fwww.amd.com%2Fus-en%2F&hide_type=all&depth=&check=Check
http://qa-dev.w3.org/wlc/checklink?uri=http%3A%2F%2Fwww.amd.com%2Fus-en%2F&hide_type=all&depth=&check=Check

Received on Thursday, 25 May 2006 08:54:24 UTC