Re: HTML and Search Engines from Joe English on 1995-08-24 (www-html@w3.org from August 1995)

From: Joe English <jenglish@crl.com>
Date: Thu, 24 Aug 1995 13:19:43 -0700
To: www-html@w3.org
Message-Id: <199508242020.AA22435@mail.crl.com>

Jay Kammerzell <jkammerz@vth1.vth.colostate.edu> wrote:

> Is there a way that standard document data could be included 
> in a referenced document for WWW search engines to glean rather than 
> taking the first few lines of that document?
> 
> It would seem that having some set of attributes that didn't display 
> but was available to a search engine could improve database searches and 
> make the returned information more meaningful. 

The canonical way to do this is to put:

	<META NAME="KEYWORDS" CONTENT="text to index here...">

in the document <HEAD>.  You can use as many <META NAME=KEYWORDS> elements
as you like.  Browsers won't display the text.

I don't know which, if any, current search engines actually
look in <META> elements.  Sophisticated engines that understand
HTML ought to check for META elements, and naive ones that just
index all text will pick up the keywords too.  

Moderately smart search engines that understand basic SGML syntax 
but not HTML semantics will most likely ignore keywords entered
in this way, since they're part of the markup.  Such engines are
also likely to ignore  too, so there's
no way that I can think of to make text invisible to browsers but
visible to these.

Again, I have no idea how any of the common search engines actually
work...

--Joe English

  jenglish@crl.com

Received on Thursday, 24 August 1995 16:58:14 UTC