W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Wed, 21 Dec 2005 11:19:54 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051221111954.21851.1217@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  }}}
  
  Again, In Bangla and Hindi etc, languages, there is usage of '''Compounding (or Samas) ''' two or more words into a single short unified word where the suffix (Vibhakti) of the first word is removed while compounding. Meaning of the compounded word may not keep intact the meanings of all the constituent words.  In Bahubrihi Samas, the compounded word may mean even a different third word only. '''COMPOUNDING Markups''' are shown as below.   
- '''Both the Euphonic and Compounding Markups are very useful for understanding both syntactic and semantic meaning of various much used unified words in Bangla and Hindi languages etc.'''
+ '''Both the Euphonic and Compounding Markups are very useful for understanding both syntactic and semantic (or meaning) of various much used unified words in Bangla and Hindi languages etc.'''
   
  {{{
  
@@ -823, +823 @@

  <!-- Sapto (seven) Aher (days') Samahar (collection) -->
  
  </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundDwigu_Chotuh_Aksharer_Samahar a word with four 
+ 
+ letters">Choturakshar
+ 
+ </pos_cat>
+ 
  
  <pos_cat name="noun" meaning="compoundBahubrihi_Su_Hridoi_Jahar good hearted person">Suhrid
  
@@ -996, +1003 @@

  
  }}}
  
- In many '''sentences''' we often use '''multilingual words'''. For example, in the Hindi sentence,  "Kaam joldi start Kijiye" (i.e., in English:- "Start the work immediately." Lexicons:- Kaam/ Work, Joldi/ immediately, Kijiye/ Do). Please note that there is an English word "start" in the source language Hindi sentence. Such usage of multilingual wordings are very common in any urban area. As we are providing the meaning of a foreign language word (e.g., start) in a sentence of some other source language, say, Hindi, so there won't be any problem for a translation parserfor understanding a sentence that contains multilingual words.
+ In many '''sentences''' we often use '''multilingual words'''. For example, in the Hindi sentence,  "Kaam Jaldi Start Kijiye" (i.e., in English:- "Start the work immediately." Lexicons:- Kaam/ Work, Jaldi/ immediately, Kijiye/ Do). Please note that there is an English word "start" in the source language Hindi sentence. Such usage of multilingual wordings are very common in any urban area. As we are providing the meaning of a foreign language word (e.g., start) in a sentence of some other source language, say, Hindi, so there won't be any problem for a translation parserfor understanding a sentence that contains multilingual words.
  
  {{{
+ 
  <!-- Markup for a Sentence having Multilingual Words -->
  
- <!-- Markup for the Hindi sentence "Kaam Joldi Start Kijiye" -->
+ <!-- Markup for the Hindi sentence "Kaam Jaldi Start Kijiye" -->
  
  <text xml:lang="hin">
  
  <sentence_cat name="imperative">
  
- Kam Joldi 
+ Kam Jaldi 
  
  <pos_cat name="verb" type="compound" meaning="start"> start kijiye </pos_cat>
  
@@ -1188, +1196 @@

  var time = d.getHours()
  
  if (time<12)
+ 
  {
  document.write("<b>Good morning</b>")
  }
+ 
  </script>
  
  </sentence_cat>
@@ -1309, +1319 @@

  <text xml:lang="ben">
  
  <!-- Sentence- Level Markups for SOV or SVO or OVS for a Bangla sentence- Paul Apple Kheyechhe
+ 
  (i.e., Paul has eaten apple - in English) -->
  
  <sentence_cat name="assertive" type="sov">
@@ -1328, +1339 @@

   Apple Kheyechhe Paul
  
  <!-- such OVS markup prevents the above sentence to be translated to-
+ 
       Apple has eaten Paul (a bad translation output) -->
  
  </sentence>
@@ -1357, +1369 @@

  Sisa Rahit Petrol</pos_cat>
  
  <!-- here, Rahit (as preceeded by a material noun) is not to mean a person but to
+ 
  mean- less or without  -->
     
  <pos_cat name="verb" type="phrase" meaning="is available">
@@ -1367, +1380 @@

  
  <!-- translated output in English- Leadless Petrol is available -->
  
- <!-- the above markups along with meaning attribute help translator a lot even if the source language specific resource is not rich with the localizers--> 
+ <!-- the above markups along with meaning attribute help translator a lot even if the source 
+ 
+ language specific resource is not rich with the localizers--> 
  
  </text>  
  
@@ -1388, +1403 @@

  </sentence_cat>
  
  <!-- here, Rahit (as preceeded by a material noun) is not to mean a person but to
+ 
  mean- less or without  -->
     
  <sentence_cat name="verb_phrase" meaning="is available">
@@ -1398, +1414 @@

  
  <!-- translated output in English- Leadless Petrol is available -->
  
- <!-- the above markups along with meaning attribute help translator a lot even if the source language specific resource is not rich with the localizers--> 
+ <!-- the above markups along with meaning attribute help translator a lot even if the source 
+ 
+ language specific resource is not rich with the localizers--> 
  
  </text>  
  
Received on Wednesday, 21 December 2005 12:31:59 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC