W3C home > Mailing lists > Public > www-validator@w3.org > September 2004

Re: checklink: Link checker unable to read working urls

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sun, 19 Sep 2004 00:03:01 +0200
To: "J. Grant" <jg@jguk.org>
Cc: www-validator@w3.org
Message-ID: <414cac5f.29941493@smtp.bjoern.hoehrmann.de>

* J. Grant wrote:
>I wonder if you are aware of the present link problem with the link 
>checker?  It seems unable to read valid, working urls.  I checked these 
>in my browser just now.  Any ideas?
>=============================================================
>http://www.panasonic.co.uk/dvd-recorders/dmre85hebs/index.htm
>     What to do: The link is broken. Fix it NOW!
>     Response status code: 404
>     Response message: Object Not Found
>     Line: 130

The server is broken, it responds with 404 to the HEAD request *and* the
response contains a body,

  % http-head http://www.panasonic.co.uk/dvd-recorders/dmre85hebs/index.htm
  HTTP/1.1 404 Object Not Found
  Date: Sat, 18 Sep 2004 21:47:08 GMT
  Connection: close
  Content-Type: text/html
  Content-Length: 102
  
  <html><head><title>Error</title></head><body>The system cannot find the file specified. </body></html>

>http://developers.slashdot.org/article.pl?sid=03/02/23/1939225&mode=thread&tid=156
>http://slashdot.org/article.pl?sid=04/08/18/2257257&tid=126&tid=1
>http://en.wikipedia.org/wiki/Vorbis

Slashdot has a robots.txt that prohibes access (though I would expect
checklink to tell you that so there might be something else going on),
wikipedia.org blocks checklink,

  % HEAD -H"User-Agent: W3C-checklink/4.0 [4.5] libwww-perl/5.800" http://en.wikipedia.org/wiki/Vorbis
  HTTP/1.1 403 Forbidden
  Connection: close
  Date: Sat, 18 Sep 2004 22:02:24 GMT
  Server: squid/2.5.STABLE4-20040219
  Content-Length: 1120
  Content-Type: text/html
  Expires: Sat, 18 Sep 2004 22:02:24 GMT
  Client-Date: Sat, 18 Sep 2004 22:02:09 GMT
  Client-Peer: 207.142.131.245:80
  Client-Response-Num: 1
  Mime-Version: 1.0
  X-Cache: MISS from wikipedia.org
  X-Squid-Error: ERR_ACCESS_DENIED 0
Received on Saturday, 18 September 2004 22:03:49 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 25 April 2012 12:14:15 GMT