W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Mon, 03 Oct 2005 20:15:08 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051003201508.31452.89527@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  [R023] To improve the translation process, it should be easy to take advantage of the capability of XML to embed linguistic-related metadata information in the structure of a document.
  
  '''[YS- This is just a first try to get things rolling. Goutam, Andrzej, anyone, please, to comments/add/etc.]]'''
+ 
+          {{{
+          The proposed scheme is to demonstrate how to embed syntactic, 
+          semantic and computational linguistic related metadata information
+          in the structure of an XML document towards better translation through 
+          useful markups meant for both the internationalization & 
+          localization processes.
+          }}}
      
     [[GS- The 3-Tier XML Schema approach is useful for an XML content
           author to embed a source human language specific metadata
@@ -43, +51 @@

  
            For example: for the '''Phrases_Idioms "cats and dogs" in english''' we can markup as
  
-           <sentence_cat name="phrases_idioms" meaning="heavily"> cats and dogs
+          {{{ <sentence_cat name="phrases_idioms" meaning="heavily"> cats and dogs
-           </sentence_cat> 
+           </sentence_cat> }}}
  
            Such metadata will be of an immense help to localazation process (in order
            to find an appropriate phrases & idioms in a target language) without
@@ -54, +62 @@

            " Dumurer (english meaning is Fig's)  Fool (english meaning is Flower)"
            we can markup as
  
-           <sentence_cat name="phrases_idioms" meaning="rarely visible">
+           {{{ <sentence_cat name="phrases_idioms" meaning="rarely visible">
             Dumurer Fool
-           </sentence_cat> 
+           </sentence_cat> }}} 
            
            '''Such metadata is very useful as a semantic markup to a localization process,
            irrespective of a target language.''' 
@@ -66, +74 @@

               for an example,
               ""Click Here for Sign Up",
               
-              '''<sentence_cat name="link"> Click Here for Sign Up
+             {{{ '''<sentence_cat name="link"> Click Here for Sign Up
-              </sentence_cat>'''
+              </sentence_cat>'''}}}
  
             or, for the link-word say, "Here", we might markup in the following way:
            
-            '''<pos_cat name="link"> Here </pos_cat>'''  ]] 
+            {{{'''<pos_cat name="link"> Here </pos_cat>''' }}} ]] 
  
  '''For the following Bengali or Bangla dialect sentence'''  
  "Kaam (Kaaj in Bangla or Work in english)  Saira Falo (Shesh Koro in
  Bangla or Complete in english)," 
  we should markup the text with the '''three-layer metadata information''' in the following way:
- 
+ {{{
  <text xml:lang="ben">
  <content_domain name="dialect">
  <!-- content domain metadata -->
@@ -92, +100 @@

  ......
  </content_domain>
  </text>
- 
+ }}}
     
  '''Metadata information about the domain, sentence type or specific words
  will help translators to do better quality work or to do the work quickly.'''
@@ -102, +110 @@

  without such information. 
   
  For example: 
+ {{{
  <content_domain name="factory">
  Paul works in a factory. 
  There are many electromechanical machines in this factory.
@@ -118, +127 @@

        specific transformation constructs to a translator-->
  ...sentence_m
  </content_domain>
- 
+ }}}
  
  '''Background Knowledge for Word-Level Parts-of-Speech Markups:-'''
  
@@ -129, +138 @@

  A verb that has an object is called transitive verb. Intransitive verb has no object. Noun
  verb is derived from a noun. Imperative verb is to denote an order, request or command.
  
- '''A noun is a naming word'''. Proper Noun names a specific people or place or 
+ ''A noun is a naming word.'' Proper Noun names a specific people or place or 
  thing (e.g. Goutam, Kolkata, India).  Common Noun is refers to a class of objects 
  or a concept as opposed to a particular individual (e.g. boy, cow). Collective 
  Noun is a  noun that denotes a group of individuals (e.g. army, assembly, family).
Received on Tuesday, 4 October 2005 05:12:38 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC