Re: Unicode Normalization

A slight correction on this that isn't really all that  germane to the  
present conversation, but I felt I should make nonetheless and this  
correction helps improve understanding of the various issues.

On Feb 4, 2009, at 3:07 PM, I wrote:
> Take for example the following three strings (NFD, NFC and non- 
> normalized):
>
> 〈this string〉
> 〈this string〉
> 〈this string〉


Actually the first form is non-normalized too. The second string is  
conforming to both NFC and NFD. The third string is non-normalized as  
well

Just to provide further clarification each line is a separate string  
where the interior "this string" is an identical code point sequence  
irrelevant for normalization purposes. The angle brackets themselves  
however, have been encoded repeatedly as different code points despite  
Unicode offering no semantically distinct interpretation between the  
two code points.

Take care,
Rob

Received on Wednesday, 4 February 2009 23:46:59 UTC