- From: Mike Taylor <mike@indexdata.com>
- Date: Wed, 25 Feb 2004 16:55:42 GMT
- To: azaroth@liverpool.ac.uk
- Cc: www-zig@w3.org
> Date: Wed, 25 Feb 2004 16:21:13 +0000 (GMT) > From: Robert Sanderson <azaroth@liverpool.ac.uk> > > > (And why would anyone want a weird.readySoundexedAuthor index? Users > > will never search against it, so its only use for searching is to > > follow up a scan; but that search is equivalent to any of > > dc.author =/stem smith > > dc.author =/stem smyth > > dc.author =/stem smythe > > anyway, so why not just use one of those?) > > It's not the same as stemming. The stem (using the porter algorithm) > of Smith is Smith. Graaaghggh! My typo, sorry. I meant "phonetic" throughout. > The results of the scan may be very odd. For example: > > smash (S520) > sanchez (S522) > smith (S530) > sanders (S536) > snail (S540) :-) > It would be good to have an otherInfo for the actual value computed > (S530 for example. That was my thought at first -- that you want something that's sort of the opposite of diplayTerm -- but then I thought, well, actually, why? What would you use it for? I can't think of anything. I really think it is nothing but dirty laundry. _/|_ _______________________________________________________________ /o ) \/ Mike Taylor <mike@indexdata.com> http://www.miketaylor.org.uk )_v__/\ "I've thought ornithopods were by far the most painfully boring dinosaurs for my entire life. If I ever develop a powerful sedative I'll name it Iguanodon" -- Matt Wedel. -- Listen to my wife's new CD of kids' music, _Child's Play_, at http://www.pipedreaming.org.uk/childsplay/
Received on Wednesday, 25 February 2004 11:56:44 UTC