W3C home > Mailing lists > Public > public-qt-comments@w3.org > June 2004

RE: PLease define 'collation'

From: Jim Melton <jim.melton@acm.org>
Date: Thu, 10 Jun 2004 11:15:12 -0600
Message-Id: <>
To: Igor Hersht <igorh@ca.ibm.com>
Cc: Jim Melton <jim.melton@acm.org>, ashokmalhotra@alum.mit.edu, "Michael Kay" <mhk@mhk.me.uk>, public-qt-comments@w3.org, Stephen.Buxton@oracle.com


I'm not completely convinced that the collation itself defines how matching 
is done.  I think that the rules for matching are heavily influenced (even 
determined) by the semantics of the collation, but I think that the act of 
matching involves string comparisons, which is covered by my proposed 
definition of "collation".

In fact, I recently analyzed that very situation and believe that my 
analysis supports my statement above.  The secret seems to be in carefully 
reading the definitions for comparisons and matching in Unicode Technical 
Report #10, especially the definition for 'minimal matching', coupled with 
an understanding of 'ignorable collation elements'.

Once those are understood, I agree with you that the definitions of 
fn:contains, fn:starts-with, fn:ends-with, fn:substring-before and 
fn:substring-after are pretty obvious.

Hope this helps,

At 10:07 AM 6/10/2004 Thursday, Igor Hersht wrote:

> >The most pithy definition of "collation" that I can devise would read
>something like this: >collation: A specification of the manner in which
>character strings are compared and, by >extension, ordered.
>This is very similar to what I was trying to say.  I think that there is
>one point
>missing here - string matching. String "a" could be equal both "a" and
>Having just comparison would not give us unambiguous matching.
>I would say
>collation: A specification of the manner  which defines character strings
>comparing  ( by extension ordered) and matching.
>(May be it could be expressed in better English).
>Defining of the collation functions is a separate issue.
>I think that the definition of the functions
>  fn:contains, fn:starts-with, fn:ends-with,
>fn:substring-before and fn:substring-after are quiet obvious
>in terms of string matching.

Jim Melton --- Editor of ISO/IEC 9075-* (SQL)     Phone: +1.801.942.0144
Oracle Corporation        Oracle Email: jim dot melton at oracle dot com
1930 Viscounti Drive      Standards email: jim dot melton at acm dot org
Sandy, UT 84093-1063              Personal email: jim at melton dot name
USA                                                Fax : +1.801.942.3345
=  Facts are facts.  However, any opinions expressed are the opinions  =
=  only of myself and may or may not reflect the opinions of anybody   =
=  else with whom I may or may not have discussed the issues at hand.  =
Received on Thursday, 10 June 2004 13:19:47 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:45:20 UTC