Re: Terms and display terms in scan

> Date: Wed, 25 Feb 2004 16:21:13 +0000 (GMT)
> From: Robert Sanderson <azaroth@liverpool.ac.uk>
>
> > (And why would anyone want a weird.readySoundexedAuthor index?  Users
> > will never search against it, so its only use for searching is to
> > follow up a scan; but that search is equivalent to any of
> >        dc.author =/stem smith
> >        dc.author =/stem smyth
> >        dc.author =/stem smythe
> > anyway, so why not just use one of those?)
> 
> It's not the same as stemming. The stem (using the porter algorithm)
> of Smith is Smith.

Graaaghggh!  My typo, sorry.  I meant "phonetic" throughout.

> The results of the scan may be very odd. For example:
> 
> smash      (S520)
> sanchez    (S522)
> smith      (S530)
> sanders    (S536)
> snail      (S540)

:-)

> It would be good to have an otherInfo for the actual value computed
> (S530 for example.

That was my thought at first -- that you want something that's sort of
the opposite of diplayTerm -- but then I thought, well, actually, why?
What would you use it for?  I can't think of anything.  I really think
it is nothing but dirty laundry.

 _/|_	 _______________________________________________________________
/o ) \/  Mike Taylor  <mike@indexdata.com>  http://www.miketaylor.org.uk
)_v__/\  "I've thought ornithopods were by far the most painfully
	 boring dinosaurs for my entire life.  If I ever develop a
	 powerful sedative I'll name it Iguanodon" -- Matt Wedel.

--
Listen to my wife's new CD of kids' music, _Child's Play_, at
	http://www.pipedreaming.org.uk/childsplay/

Received on Wednesday, 25 February 2004 11:56:44 UTC