W3C home > Mailing lists > Public > public-i18n-its@w3.org > April to June 2006

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <nobody@w3.org>
Date: Tue, 18 Apr 2006 15:13:31 -0000
To: nobody@w3.org
Message-ID: <20060418151331.10531.90717@localhost.localdomain>

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup

------------------------------------------------------------------------------
  
  }}}
    
-   
+ We often observe "hesitation phenomenon" during our daily conversation for example, in 
+ 
+ English we often use "well" during our daily conversation with the intention of taking some 
+ 
+ time before talking about something (e.g., during question answer session). Hesitation 
+ 
+ Phenomenon for example, the word we generate during our talk in hesitation: "Aya.. Aya .." in direct speech also needs to be handled properly for I18N and L10N. 
+ 
+ In Bengali we often use  ধর  (to assume)  গিয়ে  (on going)  বা  হোল  (is)  গিয়ে  (on going)
+  
+ 
+ during our conversation. Actually,  the word " ধর গিয়ে "  is intended to mean "assume" and
+  
+ 
+ the word " হোল গিয়ে " is intended to mean "is or are" only. The second part গিয়ে is to be 
+ 
+ 
+ ommitted in such occurrences of "hesitation phenomenon."  Similarly, the English word 
+ 
+ "well" in such "hesitation phenomenon" needs to be neglected or handled correctly during 
+ 
+ our conversation for localization process. All human languages might have such issues.
+ The markup for "Hesitation Phenomenon" is stated below:   
+ 
+ {{{
+ <!-- Example for Word-level Hesitation Markup -->
+  
+ <pos_cat name="verb" type="hesitation" meaning="to assume">  ধর গিয়ে  </pos_cat> 
+ 
+ <pos_cat name="verb" type="compound" meaning="go and catch" >  ধর  গিয়ে   </pos_cat> 
+ 
+ <pos_cat name="verb" type="hesitation" meaning="assume">  হোল  গিয়ে  </pos_cat> 
+ 
+ <pos_cat name="adverb" meaning="in good manner"> well </pos_cat> 
+ 
+ <pos_cat name="adverb" type="hesitation" meaning=" "> well </pos_cat>
+ 
+ <pos_cat name="adjective" meaning="in good health"> well </pos_cat> 
+ 
+ }}}
+  
  '''These markups are also useful for a content author to add ''disambiguation'' related metadata information in order to disambiguate a text / PCDATA in between "<" and ">"  from element tags''' . For an example, for the text say: ''Readers may refer to work in <Saha2005> for more information''. 
  Please note that though '''<Saha2005>''' looks identical to an element tag but it is not intended to mean it as an element tag. Rather, it is meant for readers' references only. How to convey such disambiguation information to an XML Parser ?  Solution to this problem is to markup the text in the following way in order to denote that '''<Saha2005>''' is not meant for an element tag.
  
Received on Wednesday, 19 April 2006 07:34:07 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:07 UTC