W3C home > Mailing lists > Public > public-qt-comments@w3.org > September 2009

[Bug 7630] [FO] There is no formal definition of the Unicode codepoint collation

From: <bugzilla@wiggum.w3.org>
Date: Tue, 29 Sep 2009 15:41:01 +0000
To: public-qt-comments@w3.org
Message-Id: <E1Msep3-0000Os-5P@wiggum.w3.org>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=7630


Michael Kay <mike@saxonica.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED




--- Comment #1 from Michael Kay <mike@saxonica.com>  2009-09-29 15:41:00 ---
The following proposal was accepted by the WG on 2009-09-29

ACTION A-411-02: MK will produce a textual proposal for resolving Bugzilla
#7630 (definition of the Unicode codepoint collation).

For the 1.0/2.0 specification:

Add a new paragraph after the current fourth paragraph of F+O section 7.3.1

The Unicode codepoint collation does not perform any normalization on the
supplied strings. It is defined as follows. Each of the two strings is
converted to a sequence of integers using the fn:string-to-codepoints
function. These two sequences $A and $B are then compared as follows: 

* If both sequences are empty, the strings are equal

* If one sequence is empty and the other is not, then the string
corresponding to the empty sequence is less than the other string

* If the first integer in $A is less than the first integer in $B, then the
string corresponding to $A is less than the string corresponding to $B.

* If the first integer in $A is greater than the first integer in $B, then
the string corresponding to $A is greater than the string corresponding to
$B.

* Otherwise (the first pair of integers are equal), the result is obtained
by applying the same rules recursively to fn:subsequence($A, 2) and
fn:subsequence($B, 2)

For the 1.1/2.1 specification: Use the same rules, but create a new section
containing the definition of the Unicode codepoint collation and refer to
this section from the appropriate places; and make "Unicode codepoint
collation" a defined term, hyperlinking all references to it.


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Tuesday, 29 September 2009 15:41:11 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:45:40 UTC