Re: Model question from Edward C. Zimmermann on 2003-07-28 (www-zig@w3.org from July 2003)

From: Edward C. Zimmermann <edz@elmyra.bsn.com>
Date: Mon, 28 Jul 2003 16:21:02 +0200 (MEST)
To: www-zig@w3.org
Cc: a.sanders@mcc.ac.uk
Message-Id: <200307281421.h6SEL2R07985@elmyra.bsn.com>

>
>Edward C. Zimmermann wrote:
>
>> A universe with a single record is just not very interesting
>> for these kinds of methods is it not? 
>
>I've seen single records that are several megabytes in size -- to
>be able to treat such a record as a mini database might be quite
>useful.

And what is it one is search for?

It depends upon what that record is.. And the kinds of S/R that might
be applicable can be quite a different beast from the kind use to
discover that record. Think here of genetic sequences where the tools
applied to the record for "search" as a "mini-database" are to search
for kinds of patterns and permuations that are quite distinct from the
kind of more conventional search for content used to select these records
from the larger pool of millions of records.
I'm thinking also about encoded multimedia records where that "mini-search"
may be "playback"..
Or structure discovery lanauges. etc. etc.

Or just grepping for other terms? This is possible with our conventional
approach with "highlighting".

One can also.. predefine a kind of substructure that allows for finer
search..
For  mundane things like large PDFs that can me multi-MB in size.. I
define each page, for example, as a sub-record of a contents. Same model
is possible with most all textual documents..

Since I have a concept of sentence, line, paragraph and page I can also
via the high level search mechanisms also search for items  on the same
line, sentence, paragraph etc.

>
>Ashley.

______________________
Edward C. Zimmermann, Basis Systeme netzwerk, Munich
<A 
HREF="http://www.stadtplandienst.de/query;ORT=m;PLZ=80802;STR=Leopoldstr%2E;HNR=
53;GR=2;PRINTER_FRIENDLY=TRUE">Leopoldstrasse 53-55, D-80802 Munich, Federal 
Republic of Germany</A>
Telephone:   Voice:= +49 (89) 385-47074  Fax:= +49 (89)  692-8150
          Cellular:= +49 (179) 205-0539

Received on Monday, 28 July 2003 10:21:07 UTC