W3C home > Mailing lists > Public > www-talk@w3.org > January to February 2003

Links to the original document

From: Alexander Melkov <melkov@comptek.ru>
Date: Mon, 27 Jan 2003 02:31:53 +0300
Message-ID: <004c01c2c593$1c6216d0$03010101@uuuu>
To: <www-talk@w3.org>

Modern search engines can detect duplicate and almost-duplicate documents on the web. Among the
duplicates found, which one is the main? Where is the original document?

Using document weights based on some calculations (such as link popularity) gives fairly good results.
They are not always rightful, though.

The idea offered by Martijn Koster was to use the following tag:
<LINK rel=original href="some url">
("rel" property name is to be discussed, probably "original" is not the best one)
(His mail is quoted here:
http://www.searchengines.ru/forum/showthread.php?s=&threadid=3028#post25069 )

+ I don't see any contradiction between Host field in robots.txt and this tag.

What do you think:
a) Will search engines support this tag, if offered?
b) Will webmasters support this tag?
c) What name should this tag have?
Received on Sunday, 26 January 2003 18:31:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 27 October 2010 18:14:27 GMT