Re[2]: Lists Module Comments (Technical Note)

Hello again,

>> What about Cyrillic numbering styles all the necessary information is
>> provided actually. The numbering algorithm for, say, 'lower-russian'
>> is the same as for 'lower-alpha', i. e. all the list items are listed
>> in the note. I really see no problem here.
>> Please let us know what details exactly are needed.

> Well, there is the suggestion at one point in the specification that
> 'cyrillic' repeats the last character ('many') which is different from
> 'lower-latin'.
Please read the description for 'many' carefully. It *may* apply
only to _Old Slavonic_ language and not to any modern Cyrillic based
one.

> Also, your notes do not say which systems have a
> significant zero value, or what the suffix character should be
> (Ethiopic systems use U+1366, most others use U+002E).
I don't really understand what you meant by "significant zero value".
I believe alphabetical systems cannot have zero value, thus as I
already have said Cyrillic systems are much like "-alpha" and the
expected default behavior for them is the same as for "-alpha". The
suffix character is U+002e for "upper-" values and U+0029 RIGHT
PARANTHESIS for "lower-" values.

> In particular, you give upper-hexadecimal in the same table as
> upper-alpha, even though they use different algorithms.
Well, we though they use the same algorithm.

*alpha*  *hex*
 a ---.  00 ---.
 b    |  01    |
 ...  |   ...  |
 z    |  0f    |
aa <--.  10 <--.
ab       11

Anyway, that's the question of terminology.

>>> I have added lower- and upper- armenian, however I have conflicting
>>> information for the character(s) to use for 7000. Are you sure that
>>> the 7000 digit (U+0582/U+0552) should be combined with the 600 digit
>>> (U+0578/U+0548)?
>>
>> In fact the 7000 digit represents one of the Armenian letters and this
> 
> Due to ambiguities with words like "character" and "letter", I always try
> to be explicit with codepoints. Is 7000 represented by two codepoints or
> one? I know nothing about the armenian language.
The 34th letter of the Armenian alphabet = the symbol for 7000 are
both represented by two characters. I'm not sure but it seems that
Unicode is wrong in this respect as this letter is one of the oldest
and it's unclear why it should be encoded by two characters.
The suffix for "-armenian" is U+002e. See further details in the note.

>> has been checked for a number of times. Can you tell us where did you
>> get different information and what is this information?
> 
> It was probably based on information from Frank Tang's research.
Sorry, I've no idea who he is. However please note that at the moment
there are at least two errors related to 'armenian' value: 1) no
distinction between upper and lower values; and 2) incorrect
implementation in Mozilla. Who's fault is this?

Regards,
---
  Alexander "Croll" Savenkov                  http://www.thecroll.com/
  w3@hotbox.ru                                     http://croll.da.ru/


------------------
Get free mailbox 20 Mb at http://www.hotbox.ru

Received on Saturday, 19 October 2002 11:55:15 UTC