[Bug 4697] [FT] editorial: 1.1 Full-Text Search and XML

http://www.w3.org/Bugs/Public/show_bug.cgi?id=4697


pcase@crs.loc.gov changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |FIXED




------- Comment #4 from pcase@crs.loc.gov  2007-10-15 20:14 -------
[1] The FTTF agreed.  We removed the line: "The following definitions apply to
full-text search:" and broke the items out of the list. Added Note to the "As
XQuery and XPath evolve" paragraph. Reversed the last 2 sentences. 
[2b] The FTTF agreed.  We removed "n-gram."
[5a] Phrases are not part of the containment hierarchy. A phrase can cross
sentence boundaries. No change made.
[5b] The FTTF agreed. We removed these sentences: 
Whatever a tokenizer for a particular language chooses to do, it must preserve
the containment hierarchy: paragraphs contain sentences, which contain tokens.
The tokenizer must process two codepoint equal strings in the same way, i.e.,
it must identify the same tokens. Everything else about the behavior of the
tokenizer is implementation-defined.
[6c] The FTTF agreed. We consolidated the early introductions to tokenization
into one place in 1.1, removing it from 2.1. We deleted some of the sentences
in favor of a forward pointer to 4.1.

These changes will appear in the next build of the internal Full-Text language
after the October 11 build, and in the next public version. They close the last
items in this bug. If you approve of the changes, please mark the bug closed.

Received on Monday, 15 October 2007 20:14:16 UTC