- From: Nick Arnett <narnett@verity.com>
- Date: Tue, 26 Dec 1995 20:59:04 -0800
- To: Earl Hood <ehood@imagine.convex.com>, "Daniel W. Connolly" <connolly@beach.w3.org>
- Cc: www-talk@w3.org, MHonArc List <mhonarc@rosat.mpe-garching.mpg.de>
At 3:53 PM 12/26/95, Earl Hood wrote: >As for search engines, those can be hooked in independently; which some >have done with mhonarc. It is a waste of my time, and probably other >developers of mail processors, to write search engines when one can >already utilize well developed ones like Lycos, Glimpse, etc. I've been messing around periodically with a HyperMail-like gizmo for a couple of years now. (You could see its output at http://www.mccmedia.com/ if the server weren't dead, behind a locked door right now, but it should be back up tomorrow at the latest. This is the same machine known also as asearch.mccmedia.com, where I've been doing about 50 Web-related lists. It's been a mess lately, because I'm doing the conversion on a machine at home and ftp'ing the files to the server at Verity, while trying to get ISDN working... Say, y'all don't care, do ya?) Anyway, ISDN is working (at last) and I've reached a deal with O'Reilly & Associates to do a series of books on Internet scripting (JavaScript, AppleScript and VB are planned), so I'll surely re-implement and improve my gizmos with Java/JavaScript. (Examples from the book & all that.) The suggestions here are great. >> 4. Allow relational queries: by date, author, subject, message-id, >> keywords, or any combination. Essentially, treat the archive as a >> relational database table with fields message-id, from, date, subject, >> keywords, and body. > >This is best done by utilizing an existing database system (eg Oracle), >and using mhonarc (or other prefered mail->html filter) to convert >retrievied messages to html on-the-fly. Our search engine allows relational queries as well as full-text queries, so I've recently prototyped a version that will generate the menus and threads on the fly, as searches. Thus, the user can choose a starting date, number of days and/or messages to see, as well as applying other filters. Those filters can be saved as "topics," in our lingo, which means that they'll be pre-indexed as new documents are added, so that search results are quite zippy. (See the Verity marketing mumbles at www.verity.com for supporting arguments.) I'm quite interested in adding a Harvest interface, as well as implementing the sort of rating system that was mentioned here. In fact, I think that's the most interesting potential improvements... other involve collaborative knowledgebases, "scout" robots that look for related Web resources... many ideas, only so many hours in the day. But now that I'm not confining myself to languages that run on computers design in Cupertino, perhaps some collaboration is in order. In particular, who else is interested in working up some of this stuff in Java? I need those examples for the books... ;-) Nick
Received on Tuesday, 26 December 1995 23:58:12 UTC