W3C home > Mailing lists > Public > www-validator@w3.org > September 2004

Re: checklink: Link checker unable to read working urls

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Sun, 19 Sep 2004 00:03:01 +0200
To: "J. Grant" <jg@jguk.org>
Cc: www-validator@w3.org
Message-ID: <414cac5f.29941493@smtp.bjoern.hoehrmann.de>

* J. Grant wrote:
>I wonder if you are aware of the present link problem with the link 
>checker?  It seems unable to read valid, working urls.  I checked these 
>in my browser just now.  Any ideas?
>     What to do: The link is broken. Fix it NOW!
>     Response status code: 404
>     Response message: Object Not Found
>     Line: 130

The server is broken, it responds with 404 to the HEAD request *and* the
response contains a body,

  % http-head http://www.panasonic.co.uk/dvd-recorders/dmre85hebs/index.htm
  HTTP/1.1 404 Object Not Found
  Date: Sat, 18 Sep 2004 21:47:08 GMT
  Connection: close
  Content-Type: text/html
  Content-Length: 102
  <html><head><title>Error</title></head><body>The system cannot find the file specified. </body></html>


Slashdot has a robots.txt that prohibes access (though I would expect
checklink to tell you that so there might be something else going on),
wikipedia.org blocks checklink,

  % HEAD -H"User-Agent: W3C-checklink/4.0 [4.5] libwww-perl/5.800" http://en.wikipedia.org/wiki/Vorbis
  HTTP/1.1 403 Forbidden
  Connection: close
  Date: Sat, 18 Sep 2004 22:02:24 GMT
  Server: squid/2.5.STABLE4-20040219
  Content-Length: 1120
  Content-Type: text/html
  Expires: Sat, 18 Sep 2004 22:02:24 GMT
  Client-Date: Sat, 18 Sep 2004 22:02:09 GMT
  Client-Response-Num: 1
  Mime-Version: 1.0
  X-Cache: MISS from wikipedia.org
  X-Squid-Error: ERR_ACCESS_DENIED 0
Received on Saturday, 18 September 2004 22:03:49 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:30:45 UTC