W3C home > Mailing lists > Public > www-style@w3.org > February 2009

Re: Unicode Normalization

From: Brad Kemper <brad.kemper@gmail.com>
Date: Wed, 4 Feb 2009 13:54:01 -0800
Message-Id: <D68C1478-0D8B-4B31-A120-56E3CA39B201@gmail.com>
To: Robert J Burns <rob@robburns.com>
Cc: Anne van Kesteren <annevk@opera.com>, Aryeh Gregor <Simetrical+w3c@gmail.com>, "public-i18n-core@w3.org" <public-i18n-core@w3.org>, "jonathan@jfkew.plus.com" <jonathan@jfkew.plus.com>, W3C Style List <www-style@w3.org>

Sent from my iPhone

On Feb 4, 2009, at 1:07 PM, Robert J Burns <rob@robburns.com> wrote:

> However Unicode has a SHOULD requirement that two canonically  
> equivalent but codepoint differing strings match. Unicode's Chapter  
> 3 (C6 norm) says:
>> A process shall not assume that the interpretations of two  
>> canonical-equivalent character sequences are distinct.

Your interpretation adds something that your quoted text does not  
include. The quoted text does not include "but code point differing".  
It seems quite clear (at least when read in isolation from the rest of  
the spec) that its simply saying that two canonical-equivalent  
character sequences MAY not be distinct. If they are are not code  
point differing then they wouldn't be distinct. Otherwise they would  
Received on Wednesday, 4 February 2009 21:54:42 UTC

This archive was generated by hypermail 2.3.1 : Monday, 2 May 2016 14:38:23 UTC