W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Fri, 21 Oct 2005 00:57:52 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051021005752.855.64919@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  
  '''[Felix Sasaki'''- Maybe it was not clear in the minutes of the ITS f2f at ERCIM in September, but that was what we decided to do. Goutam, I hope that I understood you correct that you would agree on what Martin formulated - that we solve the current simple (they are hard enough) problems of ITS first and come back to linguistic markup later.''']]'''
  
- '''[[Goutam Saha- Excellent remarks/comments in deed. It will be nice if AZ, MJD, FS, RI,YS or anyone add and share their knowledge on this recently proposed scheme.]]'''
+ '''[[Goutam Saha- Excellent remarks/comments in deed. It will be nice if AZ, MJD, FS, RI,YS or anyone add and share their knowledge on this recently proposed scheme at an appropriate time.]]'''
  
  '''[[Felix Sasaki- Goutam, as I said before, and - sharing AZ's comment - I think this is a tremendous piece of work. Nevertheless, I share MJD view that currently we are very busy with the preparation of the first version of the ITS tag set. We do not want to stop your work on the topic of linguistic markup, but we can not take it into account in the near future. I tried to make that clear before, I hope that you understand that.''']]
  
@@ -147, +147 @@

  '''Metadata information about the domain, sentence type or specific words
  will help translators to do better quality work or to do the work quickly'''. 
  If translators know that a word belongs to a specific domain then they can go to a 
- terminoloy data base and check the word,  thus, even for human translators this 
+ terminology database and check the word,  thus, even for human translators this 
  3-Tier or 3-layer schema will be helpful. One cannot do an accurate translation
  without such information. 
   
@@ -322, +322 @@

  
  }}} 
  
- The same sentence "This boy is strong." can also be marked up in the following way without using finer parts-of-speech categories (depending on the requirements of a translation parser for a specifc language-pair).
+ The same sentence "This boy is strong." can also be marked up in the following way without using finer parts-of-speech categories (depending on the requirements of a translation parser for a specific language-pair).
  
  {{{
  <!-- Example for Adding Word-Level Markup -->
@@ -450, +450 @@

  
  <!-- 28 minutes to 9 (in Bangla) --> 
  
- <pos_cat name="time" type="poune hh" meaning="15 to hh"> Poune 9 </pos_cat>
+ <pos_cat name="time" type="poune hh" meaning="15 to hh"> poune 9 </pos_cat>
  
- <!-- 15 minutes to 9 (in Bangla) --> 
+ <!-- 15 minutes to 9 (in Bangla, Hindi etc) --> 
  
+ <pos_cat name="time" type="soaa hh" meaning="15 past hh"> soaa 9 </pos_cat>
  
+ <!-- 15 minutes past 9 (in Bangla, Hindi etc) -->
+ 
+ <pos_cat name="time" type="saare hh" meaning="30 past hh"> saare 9 </pos_cat>
+ 
+ <!-- 30 minutes past 9 (in Bangla, Hindi etc) --> 
+  
  
  }}}
  
@@ -565, +572 @@

  
     The proposed 3-Tier XML Schema aims to markup both syntactic and semantic metadata 
     information in the structure of an XML document. This approach is an excellent
-    solution to yield meningful translation. Such embedded information are very important
+    solution to yield meaningful translation. Such embedded information are very important
     to both the internationalization and localization processes. 
  
     We need to follow the following three basic steps of the
Received on Friday, 21 October 2005 11:20:47 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC