Re: Embedded (inline) indexing tags

Has anyone though about how this new mechanism will interact with
indexing systems. Since spiders limit the amount of data that they
store isn't there a strong possibility that key metadata information
will not be indexed because it is too far down the document?

Orion Adrian

On Fri, 7 Jan 2005 15:00:36 +1100, trejkaz@trypticon.org
<trejkaz@trypticon.org> wrote:
> At Fri, Dec 31, 2004 at 06:00:20AM +0000, Thomas Hedden wrote:
> > I have always thought that there should
> > be some way of tagging words, phrases,
> > sentences, graphics (actually anything)
> > with an indexing tag that can be used to
> > generate a proper index. This is distinct
> > from META data, since META data is in the
> > header, and can only be used to find WEB
> > PAGES, not individual parts of web pages,
> 
> That view isn't particularly modern in light of the XHTML2 drafts.
> 
> Under XHTML2, you can add metadata to any element you want.  In fact, I imagine
> that the vast majority of metadata which would have been put in the header would
> find itself better placed elsewhere.
> 
>   Instead of:
>     <meta name="dc:title" content="My site"/>
>   You could have:
>     <h1 property="dc:title">My site</h1>
> 
>   Instead of:
>     <meta name="dc:copyright" content="Copyright 2005 ABC Corporation"/>
>   You could have:
>     <div property="dc:copyright">Copyright 2005 ABC Corporation</div>
> 
> The instant advantage you get is that you remove the redundant information.
> 
> Using the same attributes, you don't need to refer to an entire page, either.
> 
>   <h3 id="aa" about="#aa" property="dc:description" content="This section...">
>     ...
>   </h3>
> 
> Of course, the case you list with multiple properties defined on a single element
> still can't be done with attributes alone.  Still, this does give you a lot of what
> you thought was missing.  You can obviously tag keywords in this fashion too, just
> by spanning single words throughout the text.  Generating a list of keywords, then,
> becomes a trivial matter of matching all tags with the keyword property.
> 
> TX
> 
> --
>              Email: Trejkaz Xaoza <trejkaz@xaoza.net>
>           Web site: http://xaoza.net/trejkaz/
>          Jabber ID: trejkaz@jabber.xaoza.net
>    GPG Fingerprint: 9EEB 97D7 8F7B 7977 F39F  A62C B8C7 BC8B 037E EA73
> 
> 
>

Received on Friday, 7 January 2005 13:09:14 UTC