W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Sat, 15 Oct 2005 01:26:01 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051015012601.5297.24397@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
            " Dumurer (english meaning is Fig's)  Fool (english meaning is Flower)"
            we can markup as
  
+ {{{
+ <!-- Markup to handle phrases and idioma -->
+ 
- {{{ <sentence_cat name="phrases_idioms" meaning="rarely visible">
+ <sentence_cat name="phrases_idioms" meaning="rarely visible">
+ 
             Dumurer Fool
+ 
- </sentence_cat> }}} 
+ </sentence_cat> 
+ 
+ }}} 
            
            '''Such metadata is very useful as a semantic markup to a localization process,
            irrespective of a target language.''' 
  
            '''Again, for a link inside an XML PCDATA/ text content, we might differentiate
               links from the text by the following markup to treat them separately, 
+              for an example,        __Click Here for Sign Up__
-              for an example,
-              ""Click Here for Sign Up",
               
+ {{{
+ <!-- Markup to handle a Link -->
+ 
- {{{<sentence_cat name="link"> __Click Here for Sign Up__
+ <sentence_cat name="link"> __Click Here for Sign Up__
+ 
- </sentence_cat>}}}
+ </sentence_cat>
+ }}}
  
-            or, for the link-word say, "Here", we might markup in the following way:
+ or, for the link-word say, __Here__, we might markup in the following way:
            
  {{{
+ <!-- Markup for a Link Word -->
+ 
  <pos_cat name="link"> __Here__ 
+ 
- </pos_cat> }}} 
+ </pos_cat>
+ 
+ }}} 
  
  For the following Bengali or Bangla ''dialect'' sentence "Kaam (Kaaj in Bangla or Work in english)  Saira Falo (Shesh Koro in Bangla or Complete in english)," we should markup the text with the ''three-layer metadata information'' in the following way:
  
@@ -120, +136 @@

  For example: 
  {{{
  <content_domain name="factory">
+ 
  Paul works in a factory. 
+ 
  There are many electromechanical machines in this factory.
+ 
  ...sentence_i
+ 
     Give  <pos_cat name="noun" type="material"> oil </pos_cat> .
+ 
  ...sentence_k
+ 
  </content_domain>
+ 
  =====
+ 
  <content_domain name="office">
+ 
     Paul works in a government office.  He advises a new employee. 
+ 
    <pos_cat name="verb" type="joining" meaning="please"> Give oil </pos_cat> 
+ 
      to your senior.
+ 
-   <!--such information via attributes aim to provide semantic as well as language
+ <!--such information via attributes aim to provide semantic as well as language
-       specific transformation constructs to a translator-->
+ specific transformation constructs to a translator-->
+ 
  ...sentence_m
+ 
  </content_domain>
+ 
  }}}
  
  =='''Background Knowledge for Word-Level Parts-of-Speech Markups''':-==
@@ -164, +195 @@

  {{{
  
  <sentence_cat name="assertive"> 
+ 
-  <!-- sentence level metadata is optional here -->
+ <!-- sentence level metadata is optional here -->
+ 
+ Would you like to dance with her ? 
+ 
  Yes, I would love <pos_cat name="verb" type="anaphoric"> to </pos_cat>
+ 
  </sentence_cat> 
  
  }}}
@@ -346, +382 @@

  
  }}}
  
- An example is given below how to add metadata information to skip translation process for a sentence in a cultural content's chanting part. 
+ An example is given below how to add metadata information to skip translation process for a sentence in a '''Cultural''' content's chanting part. 
  
  {{{
  
@@ -359, +395 @@

  
  <!-- above sentences need to be translated-->
  
- <sentence_cat name="chant" type="no_translate">
+ <sentence_cat name="chant" type="no_translation">
  
  Om Ganeshaioh Namoh.
  
Received on Saturday, 15 October 2005 11:22:34 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:43:06 UTC