W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Sun, 11 Dec 2005 13:36:18 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051211133618.20553.98235@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  
  {{{
  
- In this 3-Tier or 3-Layer XML Schema, the 1st Layer Schema for the Content Domain is intended to add metadata regarding the context of a paragraph or of whole text of a web content. This content_domain metadata helps the translation process to understand the context easily and as a result, an appropriate terminology can be sought out from the lexicon/dictionary at the localizer's end. Again,  '''Content Domain Metadata''' guides also the localization process on content writing style. For an exaple, a sport content should be translated or localized as a sport content only using sport terminologies; it should not be translated or localized using funeral terminology that might result in a funeral content.
+ In this 3-Tier or 3-Layer XML Schema, the 1st Layer Schema for the Content Domain is intended to add metadata regarding the context of a paragraph or of whole text of a web content. This content_domain metadata helps the translation process to understand the context easily and as a result, an appropriate terminology (e.g., out of the synonyms for a word in source language) can be sought out from the lexicon/dictionary at the localizer's end. Again,  '''Content Domain Metadata''' guides also the localization process on content writing style. For an exaple, a sport content should be translated or localized as a sport content only using sport terminologies; it should not be translated or localized using funeral terminology that might result in a funeral content.
  
  The 2nd Layer Schema or the Sentence Level Schema takes care of the sentences in a paragraph or in a full text for a particular content domain of web content. Such markup on sentence level is intended to provide much useful syntactic and semantic metadata regarding a sentence.   Metadata on Sentence Syntax / Formation (for example, simple, complex and compound etc) and the metadata on Sentence Semantics (for example, demonstrative, interrogative or taunting etc) are of great importance  to a translator. A sentence's Syntactic metadata eases and quickens both lexical and systax analysis during the parsing of a source language sentence. A sentence's Semantic Metadata is very much useful in both semantic analysis and in generation of more meaningful sentence in a target (human) language. 
  
@@ -665, +665 @@

  
  For example:
  
- (a) "Paul goes" takes the form "Paul Jata Hai" (Hindi); here, the subject- Paul is masculine gender, third person and singular number, verb-form is "Jata Hai. " 
+ (a) "Paul goes" takes the form "Paul Jata Hai" (Hindi); here, the subject- Paul is of masculine gender, third person and singular number, verb-form is "Jata Hai. " 
  
  (b) "Ami goes" takes the form "Ami Jatee Hai" (Hindi); here, the subject- Ami is feminine gender, third person and singular number, verb-form is "Jatee Hai." 
  
@@ -682, +682 @@

  (g) "You go" will be "Tum Jate Ho" (Hindi); here, "you" is meant to a male (second 
  person, singular number).
  
- (h) "You go" will be "Tum Jatee Ho" (Hindi); here, "you" is meant to a female (second
+ (h) "You go" will be "Tum Jatee Ho" (Hindi); here, "you" refers to a female (second
  person, singular number).   
  
  So, we need to add such '''Morphological''' metadata for better internationalization & localization (e.g., localizing English content to Hindi).
@@ -709, +709 @@

  
  <pos_cat name="verb">go</pos_cat>
  
- <!-- Masculine Gender 3rd Person Plural Number -->
+ <!-- Here, the word "they" refers to boys, Masculine Gender 3rd Person Plural Number -->
  
  
  <pos_cat name="pronoun" meaning="female_per3_numN"> They</pos_cat>
  
  <pos_cat name="verb">go</pos_cat>
  
- <!-- Feminine Gender 3rd Person Plural Number -->
+ <!-- Here, the word "they" refers to gorls, Feminine Gender 3rd Person Plural Number -->
  
  
  <pos_cat name="pronoun" meaning="male_per2_num1">You</pos_cat>
@@ -2042, +2042 @@

  }}}
  
  ====================================
+ {{{
  
  References:
+ 
  1. (Book) R.P. Sinha, Current English Grammar and Usage, Oxford University Press, 2003.
+ 
  2. (Book)Bamandeb Chakraborty, Uchchotoro Bangla Byakaran.
+ 
  3.  http://www.w3.org/TR/its 
+ 
  4. Goutam Kumar Saha, "A Novel 3-Tier XML Schematic Approach for Web Page Translation," ACM Ubiquity, Vol 6(43), November 2005, ACM Press, USA.
+ 
  5. Goutam Kumar Saha, "The EB-ANUBAD Translator: A Hybrid Scheme," International Journal of Zhejiang University Science, Vol 6A(10), October, 2005, RPC.
+ 
  6. Goutam Kumar Saha, "The E2B Machine Translation: A New Approach to HLT," ACM Ubiquity, Vol 6(32), August, 2005, ACM Press, USA.
+ 
+ }}}
+ 
  7. http://www.w3.org/TR/itsreq/ 
  
Received on Sunday, 11 December 2005 15:51:24 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC