W3C home > Mailing lists > Public > www-i18n-comments@w3.org > February 2001

Re: Library of Congress comments on W3C Draft Character Model

From: John Cowan <jcowan@reutershealth.com>
Date: Fri, 23 Feb 2001 12:38:59 -0500
Message-ID: <3A96A033.20102@reutershealth.com>
To: duerst@w3.org
CC: rden@loc.gov, www-i18n-comments@w3.org, w3c-i18n-ig@w3.org
Martin Duerst wrote:

> First, do you have any pointers to ANSEL? (web-based ones are preferred)

http://lcweb.loc.gov/marc/specifications/speccharlatin.html

> In particular, in this ansel system, are the diacritics stored before or
> after the base letter?

Seemingly, before-the-base is the usual convention.  But LOC is already
coping with that in its MARC systems, which use direct transliteration
but reorder to after-the-base.  Direct transliteration of ASCII/ANSEL
followed by reordering almost gives Unicode NFD, except that ANSEL
encodes CAPITAL and SMALL LETTERs O and U WITH HORN using a single code.

 
-- 
There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein
Received on Friday, 23 February 2001 12:38:53 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 27 October 2009 08:32:27 GMT