Re: Unicode Normalization from Brad Kemper on 2009-02-04 (www-style@w3.org from February 2009)

From: Brad Kemper <brad.kemper@gmail.com>
Date: Wed, 4 Feb 2009 13:54:01 -0800
To: Robert J Burns <rob@robburns.com>
Cc: Anne van Kesteren <annevk@opera.com>, Aryeh Gregor <Simetrical+w3c@gmail.com>, "public-i18n-core@w3.org" <public-i18n-core@w3.org>, "jonathan@jfkew.plus.com" <jonathan@jfkew.plus.com>, W3C Style List <www-style@w3.org>
Message-Id: <D68C1478-0D8B-4B31-A120-56E3CA39B201@gmail.com>

Sent from my iPhone

On Feb 4, 2009, at 1:07 PM, Robert J Burns <rob@robburns.com> wrote:

> However Unicode has a SHOULD requirement that two canonically  
> equivalent but codepoint differing strings match. Unicode's Chapter  
> 3 (C6 norm) says:
>>>
>>
>>
>
>> A process shall not assume that the interpretations of two  
>> canonical-equivalent character sequences are distinct.

Your interpretation adds something that your quoted text does not  
include. The quoted text does not include "but code point differing".  
It seems quite clear (at least when read in isolation from the rest of  
the spec) that its simply saying that two canonical-equivalent  
character sequences MAY not be distinct. If they are are not code  
point differing then they wouldn't be distinct. Otherwise they would  
be.

Received on Wednesday, 4 February 2009 21:54:42 UTC