W3C home > Mailing lists > Public > public-qt-comments@w3.org > February 2009

[Bug 6469] [FT] TestSuite issues

From: <bugzilla@wiggum.w3.org>
Date: Fri, 06 Feb 2009 23:13:02 +0000
To: public-qt-comments@w3.org
Message-Id: <E1LVZsc-0006jz-Bs@wiggum.w3.org>

http://www.w3.org/Bugs/Public/show_bug.cgi?id=6469





--- Comment #6 from Jim Melton <jim.melton@acm.org>  2009-02-06 23:13:02 ---
Christian, I think that I still disagree. The only way that "a.b" could contain
"a b" or vice versa is if the tokenizer recognized "." as a token separator and
not as a token. Because tokenization is so completely implementation-defined,
anything is possible. Maybe Pat or Mary will have some good ideas about this,
'cause I have too little real-world Full Text experience to be very certain. 

W.R.T. your question about whether spaces can be recognized as tokens: Again,
because tokenization is so completely implementation-defined, it's possible.
However, I do not believe that a tokenizer that did that would survive in the
marketplace, so I don't believe we need to accommodate that possibility. 

Therefore, your query:
   'A B' ftcontains ('A' ftand 'B') distance at most 0 words
would, IMHO, always return true. 


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Friday, 6 February 2009 23:13:10 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:57:26 UTC