Re: Library of Congress comments on W3C Draft Character Model

Martin Duerst wrote:

> First, do you have any pointers to ANSEL? (web-based ones are preferred)

http://lcweb.loc.gov/marc/specifications/speccharlatin.html

> In particular, in this ansel system, are the diacritics stored before or
> after the base letter?

Seemingly, before-the-base is the usual convention.  But LOC is already
coping with that in its MARC systems, which use direct transliteration
but reorder to after-the-base.  Direct transliteration of ASCII/ANSEL
followed by reordering almost gives Unicode NFD, except that ANSEL
encodes CAPITAL and SMALL LETTERs O and U WITH HORN using a single code.

 
-- 
There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein

Received on Friday, 23 February 2001 12:38:53 UTC