Martin Duerst wrote: > First, do you have any pointers to ANSEL? (web-based ones are preferred) http://lcweb.loc.gov/marc/specifications/speccharlatin.html > In particular, in this ansel system, are the diacritics stored before or > after the base letter? Seemingly, before-the-base is the usual convention. But LOC is already coping with that in its MARC systems, which use direct transliteration but reorder to after-the-base. Direct transliteration of ASCII/ANSEL followed by reordering almost gives Unicode NFD, except that ANSEL encodes CAPITAL and SMALL LETTERs O and U WITH HORN using a single code. -- There is / one art || John Cowan <jcowan@reutershealth.com> no more / no less || http://www.reutershealth.com to do / all things || http://www.ccil.org/~cowan with art- / lessness \\ -- Piet HeinReceived on Friday, 23 February 2001 12:38:53 UTC
This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 22:39:56 UTC