[Prev][Next][Index][Thread]

Re: A character is in the eye of the beholder



Jonathan Rosenne writes:

> The simplest transformation is to decompose all composites and sort all
> combining characters that attach to a single base character in binary
> order. This guarantees a unique and permanent canonical representation.
> An alternative is to replace all those combinations that are defined
> with the composite. This transformation is dependent on the version of
> the standard, since new composite characters are being discovered from
> time to time, but is still satisfactory.

The problem with this is that the standard sorting specifications are
done on the whole characters, not the "decomposed" composite
sequences. Also for that reason it would be advantegous to code
the information in the 10646 characters so you have support for
sorting.  Building on the 10646 standard allows you to draw on
all other ISO standardized work building on the standard, and thus
to have an aligned set of standard conforming specifications.

Keld