W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Mon, 19 Dec 2005 05:01:25 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051219050125.6470.37910@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  
  }}}
  
- In many languages like Hindi, Bangla etc, there are many '''Euphonic rules''' that unify words to form a new unified word. Spelling of the unified word is also get altered to some extent at the joining part. Often meaning of a unified word consists the meanings of its ingredient words to some extent. For example, in Bangla,Hindi etc, the unified word "Bidyalay" (school) = Bidya (education) + Alay (house), Rabeendra = Rabi + Indra,  and Debarshi = Deb + Rishi, etc. 
+ In many languages like Hindi, Bangla etc, there are many '''Euphonic (Sandhi) rules''' that unify words to form a new unified word. Spelling of the unified word is also get altered to some extent at the joining part. Often meaning of a unified word consists the meanings of its ingredient words. For example, in Bangla,Hindi etc, the unified word "Bidyalay" (school) = Bidya (education) + Alay (house), Rabeendra = Rabi + Indra,  and Debarshi = Deb + Rishi, etc. 
  Markups for '''Euphonic rules''' are stated below.
  
  {{{
@@ -782, +782 @@

  </pos_cat>
  
  
- }}} 
+ }}}
+ 
+ Again, In Bangla and Hindi etc, languages, there is usage of '''Compounding (or Samas) ''' two or more words into a single short unified word where the suffix (Vibhakti) of the first word is removed while compounding. Meaning of the compounded word may not keep intact the meanings of all the constituent words.  In Bahubrihi Samas, the compounded word may mean even a different third word only. '''COMPOUNDING Markups''' are shown as below.   
+ 
+ {{{
+ 
+ <!-- Compounding Markups -->
+ 
+ <pos_cat name="noun" meaning="compoundDwandwa_Bhai_and_Bon Brother and
+ 
+ Sister">Bhaibon
+ 
+ </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundTatpurush_Raja_Theke_Bhoi Afraid of King">Rajbhoi
+ 
+ <!-- Raja (king) Theke (from) Bhoi (fear) -->
+ 
+ </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundTatpurush_Raja_Theke_Bhoi Afraid of King">Rajbhoi
+ 
+ <!-- Raja (king) Theke (from) Bhoi (fear) -->
+ 
+ </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundDwigu_Sapto_Aher_Samahar week">Saptaho
+ 
+ <!-- Sapto (seven) Aher (days') Samahar (collection) -->
+ 
+ </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundBahubrihi_Su_Hridoi_Jahar good hearted">Suhrid
+ 
+ <!-- Su (good) Hridoi (heart) Jahar (whose); this indicates a man who is good-hearted -->
+ 
+ </pos_cat>
+ 
+ <pos_cat name="noun" meaning="compoundBahubrihi_Su_Arno_Jahar gold">Swarno
+ 
+ <!-- Su (good) Arno (colour) Jahar (whose); this indicates to gold only -->
+ 
+ </pos_cat>
+ 
+ }}}
+ 
+ 
+ 
  
  We can also provide metadata on a '''class''' of an object (e.g. for nounian words), which helps in meaningful translation and in preparing synset etc. This is to provide semantic (or some preliminary '''ontology''') metadata. For an example, autorickshaw is one kind of vehicle only.
  
Received on Monday, 19 December 2005 05:45:24 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC