W3C home > Mailing lists > Public > public-qt-comments@w3.org > February 2011

[Bug 11885] [XQFTTS] english-stems.txt stemming dictionary

From: <bugzilla@jessica.w3.org>
Date: Wed, 23 Feb 2011 10:37:44 +0000
To: public-qt-comments@w3.org
Message-Id: <E1PsC6K-0000rj-N8@jessica.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=11885

--- Comment #4 from Tim Mills <tim@cbcl.co.uk> 2011-02-23 10:37:44 UTC ---
(In reply to comment #2)
> So how would one do case-insensitve stemming?

Assuming that

lowercase(AB) = ab
lowercase(Ab) = ab
lowercase(aB) = ab
lowercase(ab) = ab

one would ensure that if the implementation's stemming algorithm was such that

stem(AB) = AB

then 

stem(Ab) = Ab
stem(aB) = aB
stem(ab) = ab

Thus when the case option is case-insensitive, applying the case option to the
stem would always return 'ab' for each of AB, Ab, aB and ab.

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Wednesday, 23 February 2011 10:37:47 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:45:45 UTC