- From: <bugzilla@wiggum.w3.org>
- Date: Mon, 15 Oct 2007 20:14:09 +0000
- To: public-qt-comments@w3.org
- CC:
http://www.w3.org/Bugs/Public/show_bug.cgi?id=4697 pcase@crs.loc.gov changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |FIXED ------- Comment #4 from pcase@crs.loc.gov 2007-10-15 20:14 ------- [1] The FTTF agreed. We removed the line: "The following definitions apply to full-text search:" and broke the items out of the list. Added Note to the "As XQuery and XPath evolve" paragraph. Reversed the last 2 sentences. [2b] The FTTF agreed. We removed "n-gram." [5a] Phrases are not part of the containment hierarchy. A phrase can cross sentence boundaries. No change made. [5b] The FTTF agreed. We removed these sentences: Whatever a tokenizer for a particular language chooses to do, it must preserve the containment hierarchy: paragraphs contain sentences, which contain tokens. The tokenizer must process two codepoint equal strings in the same way, i.e., it must identify the same tokens. Everything else about the behavior of the tokenizer is implementation-defined. [6c] The FTTF agreed. We consolidated the early introductions to tokenization into one place in 1.1, removing it from 2.1. We deleted some of the sentences in favor of a forward pointer to 4.1. These changes will appear in the next build of the internal Full-Text language after the October 11 build, and in the next public version. They close the last items in this bug. If you approve of the changes, please mark the bug closed.
Received on Monday, 15 October 2007 20:14:16 UTC