- From: Edward C. Zimmermann <edz@elmyra.bsn.com>
- Date: Mon, 28 Jul 2003 16:21:02 +0200 (MEST)
- To: www-zig@w3.org
- Cc: a.sanders@mcc.ac.uk
> >Edward C. Zimmermann wrote: > >> A universe with a single record is just not very interesting >> for these kinds of methods is it not? > >I've seen single records that are several megabytes in size -- to >be able to treat such a record as a mini database might be quite >useful. And what is it one is search for? It depends upon what that record is.. And the kinds of S/R that might be applicable can be quite a different beast from the kind use to discover that record. Think here of genetic sequences where the tools applied to the record for "search" as a "mini-database" are to search for kinds of patterns and permuations that are quite distinct from the kind of more conventional search for content used to select these records from the larger pool of millions of records. I'm thinking also about encoded multimedia records where that "mini-search" may be "playback".. Or structure discovery lanauges. etc. etc. Or just grepping for other terms? This is possible with our conventional approach with "highlighting". One can also.. predefine a kind of substructure that allows for finer search.. For mundane things like large PDFs that can me multi-MB in size.. I define each page, for example, as a sub-record of a contents. Same model is possible with most all textual documents.. Since I have a concept of sentence, line, paragraph and page I can also via the high level search mechanisms also search for items on the same line, sentence, paragraph etc. > >Ashley. ______________________ Edward C. Zimmermann, Basis Systeme netzwerk, Munich <A HREF="http://www.stadtplandienst.de/query;ORT=m;PLZ=80802;STR=Leopoldstr%2E;HNR= 53;GR=2;PRINTER_FRIENDLY=TRUE">Leopoldstrasse 53-55, D-80802 Munich, Federal Republic of Germany</A> Telephone: Voice:= +49 (89) 385-47074 Fax:= +49 (89) 692-8150 Cellular:= +49 (179) 205-0539
Received on Monday, 28 July 2003 10:21:07 UTC