- From: <w3t-archive+esw-wiki@w3.org>
- Date: Sun, 11 Dec 2005 13:36:18 -0000
- To: w3t-archive+esw-wiki@w3.org
Dear Wiki user, You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification. The following page has been changed by GoutamSaha: http://esw.w3.org/topic/its0908LinguisticMarkup ------------------------------------------------------------------------------ {{{ - In this 3-Tier or 3-Layer XML Schema, the 1st Layer Schema for the Content Domain is intended to add metadata regarding the context of a paragraph or of whole text of a web content. This content_domain metadata helps the translation process to understand the context easily and as a result, an appropriate terminology can be sought out from the lexicon/dictionary at the localizer's end. Again, '''Content Domain Metadata''' guides also the localization process on content writing style. For an exaple, a sport content should be translated or localized as a sport content only using sport terminologies; it should not be translated or localized using funeral terminology that might result in a funeral content. + In this 3-Tier or 3-Layer XML Schema, the 1st Layer Schema for the Content Domain is intended to add metadata regarding the context of a paragraph or of whole text of a web content. This content_domain metadata helps the translation process to understand the context easily and as a result, an appropriate terminology (e.g., out of the synonyms for a word in source language) can be sought out from the lexicon/dictionary at the localizer's end. Again, '''Content Domain Metadata''' guides also the localization process on content writing style. For an exaple, a sport content should be translated or localized as a sport content only using sport terminologies; it should not be translated or localized using funeral terminology that might result in a funeral content. The 2nd Layer Schema or the Sentence Level Schema takes care of the sentences in a paragraph or in a full text for a particular content domain of web content. Such markup on sentence level is intended to provide much useful syntactic and semantic metadata regarding a sentence. Metadata on Sentence Syntax / Formation (for example, simple, complex and compound etc) and the metadata on Sentence Semantics (for example, demonstrative, interrogative or taunting etc) are of great importance to a translator. A sentence's Syntactic metadata eases and quickens both lexical and systax analysis during the parsing of a source language sentence. A sentence's Semantic Metadata is very much useful in both semantic analysis and in generation of more meaningful sentence in a target (human) language. @@ -665, +665 @@ For example: - (a) "Paul goes" takes the form "Paul Jata Hai" (Hindi); here, the subject- Paul is masculine gender, third person and singular number, verb-form is "Jata Hai. " + (a) "Paul goes" takes the form "Paul Jata Hai" (Hindi); here, the subject- Paul is of masculine gender, third person and singular number, verb-form is "Jata Hai. " (b) "Ami goes" takes the form "Ami Jatee Hai" (Hindi); here, the subject- Ami is feminine gender, third person and singular number, verb-form is "Jatee Hai." @@ -682, +682 @@ (g) "You go" will be "Tum Jate Ho" (Hindi); here, "you" is meant to a male (second person, singular number). - (h) "You go" will be "Tum Jatee Ho" (Hindi); here, "you" is meant to a female (second + (h) "You go" will be "Tum Jatee Ho" (Hindi); here, "you" refers to a female (second person, singular number). So, we need to add such '''Morphological''' metadata for better internationalization & localization (e.g., localizing English content to Hindi). @@ -709, +709 @@ <pos_cat name="verb">go</pos_cat> - <!-- Masculine Gender 3rd Person Plural Number --> + <!-- Here, the word "they" refers to boys, Masculine Gender 3rd Person Plural Number --> <pos_cat name="pronoun" meaning="female_per3_numN"> They</pos_cat> <pos_cat name="verb">go</pos_cat> - <!-- Feminine Gender 3rd Person Plural Number --> + <!-- Here, the word "they" refers to gorls, Feminine Gender 3rd Person Plural Number --> <pos_cat name="pronoun" meaning="male_per2_num1">You</pos_cat> @@ -2042, +2042 @@ }}} ==================================== + {{{ References: + 1. (Book) R.P. Sinha, Current English Grammar and Usage, Oxford University Press, 2003. + 2. (Book)Bamandeb Chakraborty, Uchchotoro Bangla Byakaran. + 3. http://www.w3.org/TR/its + 4. Goutam Kumar Saha, "A Novel 3-Tier XML Schematic Approach for Web Page Translation," ACM Ubiquity, Vol 6(43), November 2005, ACM Press, USA. + 5. Goutam Kumar Saha, "The EB-ANUBAD Translator: A Hybrid Scheme," International Journal of Zhejiang University Science, Vol 6A(10), October, 2005, RPC. + 6. Goutam Kumar Saha, "The E2B Machine Translation: A New Approach to HLT," ACM Ubiquity, Vol 6(32), August, 2005, ACM Press, USA. + + }}} + 7. http://www.w3.org/TR/itsreq/
Received on Sunday, 11 December 2005 15:51:24 UTC