Sync languages - HTML

On syncronizing multilingual parallel texts marked in HTML - more...

1) If possible, avoid new tag/attribute.
2) The mark must be clearly identify as a sync.
3) It must safely ignored by unconcerned programs.
4) Only one language per document (except for minor bits).
5) Some linguistic versions could be imcompleted (translated 40%).

6) Identification of the language in the header of the doc.  Usable also 
by robots (Altavista like).

7) Forward compatible with XML (other such as TEI ?).

8) At least two types of functionalities (could be the same tag):
   sync  .- This is the same point in the document.
   align .- This is the same "linguistic unit".

9) Capable to aling down to a word of part of a word level

   <TAG ID=1>im</TAG><TAG ID=2>possible</TAG>

10) NAME (in A)
11) ID
12) A new attribute in SPAN

13) More general solutions (elaborate).

Tomas

Received on Tuesday, 14 January 1997 17:00:27 UTC