- From: <bugzilla@wiggum.w3.org>
- Date: Mon, 11 Sep 2006 16:40:14 +0000
- To: public-qt-comments@w3.org
- CC:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=3698
Summary: [FT] Interaction between FTDiacriticsOption and
collation unclear
Product: XPath / XQuery / XSLT
Version: Working drafts
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: Full Text
AssignedTo: jim.melton@acm.org
ReportedBy: doerre@de.ibm.com
QAContact: public-qt-comments@w3.org
Editorial
Some of the entries of the Diacritics Matrix in 3.2.2 do not clearly describe
what the intended comparison operation for the given case should be. In
particular, the entries for
- entry for UCC / "insensitive", which states "compare as if with and without"
(well, what???)
- 4 entries for UCC+CDS / "with" + "without diacritics", which use an
exemplary query.
The reader has no clue how to interpret those exemplary queries and even if
they are meant to show how to reduce the "with" and "without" options to the
other options, there are several problems with those queries.
E.g. in the entry for CDS / "with diacritics" the query stated there:
"resume diacritics insensitive" not in "resume"
(i) is syntactically not what it meant to be (probably: "resume" diacritics
insensitive not in "resume"),
(ii) depends on diacritic options higher up the query tree, or a specified
default for the diacritic option (note that the second "resume" term is matched
according to that diacritic setting);
and (iii) can never have a match in the default case where the second "resume"
is matched insensitive as well.
So maybe, this query should be:
"resume" diacritics insensitive not in "resume" diacritics sensitive
(which would indeed be an equivalent rewrite for "resume" with diacritics,
because the term "resume" is spelled deliberately without diacritics in the
second subquery), but then what would be the case for "without diacritics"?
Also the rewriting relies that we have control over whether the query term
contains diacritics itself and how it would need to be transposed in case it
did. In general, however, we cannot assume this. E.g. consider the query:
$node ftcontains $term with diacritics
/jochen
Received on Monday, 11 September 2006 16:40:24 UTC