Re: Think before you write Semantic Web crawlers

On 21 Jun 2011, at 10:44, Martin Hepp wrote:
> PS: I will not release the IP ranges from which the trouble originated, but rest assured, there were top research institutions among them.

The right answer is: name and shame. That is the way to teach them.

Like Karl said, we should collect information about abusive crawlers so that site operators can defend themselves. It won't be *that* hard to research and collect the IP ranges of offending universities.

I started a list here:
http://www.w3.org/wiki/Bad_Crawlers

The list is currently empty. I hope it stays that way.

Thank you all,
Richard

Received on Wednesday, 22 June 2011 21:50:06 UTC