Re: Search Engine CSS from Craig Francis on 2006-07-05 (www-style@w3.org from July 2006)

From: Craig Francis <craig@synergycms.com>
Date: Thu, 6 Jul 2006 00:32:23 +0100
To: www-style@w3.org
Message-Id: <23AE5D2E-D215-46A0-94E1-4B3036787D72@synergycms.com>

Hello Emrah,

I think its down to either using the robots.txt file, or the "robots"  
<meta> tag:

	<meta name="robots" content="noindex,nofollow" />


Both of these methods are only work for whole pages... they cannot  
target elements within the page (which is what I was attempting to  
improve on).

But please remember that these methods only act as a guide... some  
robots might/will ignore them.

http://www.searchtools.com/robots/robots-txt.html
http://www.robotstxt.org/wc/meta-user.html


Craig



On 5 Jul 2006, at 23:48, Emrah BASKAYA wrote:

>
> On Thu, 06 Jul 2006 01:39:43 +0300, Patrick H. Lauke  
> <redux@splintered.co.uk> wrote:
>
>>
>> Craig Francis wrote:
>>>  I have recently been thinking about how search engine spiders  
>>> index a website.
>>>  At the moment we can provide ways for the spiders to find the  
>>> page, but I do not think its possible for us to tell the spider  
>>> things like where the navigation bar is (as I think it should be  
>>> indexed differently to the main content of the page).
>>>  I have written a fairly small document explaining how a future  
>>> version of CSS could help present the documents structure and  
>>> data to search engine spiders in a better way than their current  
>>> guesswork methods.
>>>  It would be great if you can give me some feedback.
>>>  http://www.krang.org.uk/searchEngineCSS/
>>
>> You're proposing the use of CSS to impart semantic meaning (e.g.  
>> content: keywords and importance: 1) and behaviour (e.g. links:  
>> ignore)? Sorry, but this is certainly beyond the scope of what CSS  
>> should do.
>>
>> And no, the speech output CSS rules are not comparable: they  
>> define how something should be presented, aurally...not what the  
>> meaning or behaviour are.
>>
>> P
>
> I second Patrick's views. This is in HTML's scope, I believe.
>
> That said, currently, is there a way to to mark part of an html  
> document to not be spidered? For example, an author may not want  
> the comment section to be indexed at all by the search bot. Is  
> there such a way? I'd be glad if someone provided an answer..
>
> Emrah BASKAYA
> www.hesido.com
>

Received on Wednesday, 5 July 2006 23:32:34 UTC