Re: A character is in the eye of the beholder
Jonathan Rosenne wrote:
>Keld J|rn Simonsen wrote:
>> I am not sure why it does not work to follow the international
>> standards in this area. I am talking also of SC22/WG20 who is working
>> on sorting on the whole of 10646. I gave you a reference earlier.
>As far as I can see the proposed international sorting algorithm for
>10646 works just as well with combining characters.
Just to clear up the confusion a little bit:
- A sorting algorithm/standard of course should consider combining
characters, and I assume that the proposed standards do so.
- An algorithm that checks for equivalence (in the sense that the
user will see various encodings as equivalent in all cases)
can use sorting of codepoint as one step. This is elementary
binary sorting, and should not be confused with higher-level
sorting of characters.