- From: Ville Skyttä <ville.skytta@iki.fi>
- Date: Sat, 22 Nov 2003 17:53:16 +0200
- To: www-validator@w3.org
On Thu, 2003-11-20 at 22:15, Dr. Georg Czedik-Eysenberg wrote:
> >> But the option "Check linked documents recursively"
> >> should provide the possibility, only to check links in
> >> documents on the same site as the originally entered document.
>
> > This is how the link checker should behave in CGI mode. When run from
> > the command line, one can use the --location option to configure the
> > recursion scope. Both ways behave as expected here.
>
> I am not sure, what you mean with "CGI mode" or "from the command line".
> I mean the "online version" at http://validator.w3.org/checklink.
Sorry for being unclear. "CGI mode" corresponds to the "online
version"; checklink can be also downloaded and run from the command line
locally.
> > Do you have an URL where unexpected results can be witnessed?
>
> Yes:
>
> http://validator.w3.org/checklink?uri=http%3A%2F%2Fgeorg.czedik.net%2Fungarn.htm&summary=on&hide_redirects=on&hide_type=all&recursive=on&depth=1&cookie=nochanges&check=Check
>
> does not only check the links in my documents http://georg.czedik.net/...
> but also in http://www.info-serve.de/...
Oh, indeed. Thanks for the sample URI! I believe I found out what was
causing this, it's links like these:
http://georg.czedik.net/cgi-bin/link-to.sh?http://www.info-serve2.de/[...]
Now, our recursion scope was http://georg.czednik.net/, and the above
link is in that scope. However, checklink did not check the scope again
when the URI was redirected to http://www.info-serve2.de/[...], causing
it to think that we're safely within the scope, and continuing the
recursion "off-site".
This should be now taken care of in the CVS revision 3.6.2.23 of
checklink [1], the scope is now checked both before and after ("after"
only if "before" was in scope), because I believe that's what most
people expect.
Before there is a public online version of the "fixed" checklink,
removing the link-to.sh CGI script and linking directly to the target
resources would be the only "workaround" I can come up with.
[1] http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/checklink.pl
Received on Saturday, 22 November 2003 10:59:26 UTC