W3C home > Mailing lists > Public > public-i18n-its@w3.org > October to December 2005

[ESW Wiki] Update of "its0908LinguisticMarkup" by GoutamSaha

From: <w3t-archive+esw-wiki@w3.org>
Date: Fri, 14 Oct 2005 00:17:24 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20051014001724.10247.59696@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by GoutamSaha:
http://esw.w3.org/topic/its0908LinguisticMarkup


------------------------------------------------------------------------------
  '''[AZ- This is a tremendous piece of work by Goutam. It is also an area that I am very interested in
  and would like to assist Goutam in developing this topic when we are ready to deliberate it in detail.
  Well done!.]]'''
+ 
+ [GS- It will be nice if AZ, MJD, FS or anyone share their knowledge on this recently proposed scheme.]]
  
  '''[FS'''- Maybe it was not clear in the minutes of the ITS f2f at ERCIM in September, but that was what we decided to do. Goutam, I hope that I understood you correct that you would agree on what Martin formulated - that we solve the current simple (they are hard enough) problems of ITS first and come back to linguistic markup later.''']]'''
  
@@ -293, +295 @@

  '''Punctuation:'''
  (a) ''Comma:'' ,  (b) ''Sentence Final:'' . ! ? |  (c) ''Quote:'' ' "
  (d) ''Left Parenthesis:'' ( [ { <  (e) ''Right Parenthesis:'' ) ] } >
- (f) Mid-Sentence Pinctuation:'' : ; -  (g) ''Others:'' + - % ^ & * / \
+ (f) Mid-Sentence Pinctuation:'' : ; -  (g) ''Others:'' + - % ^ & * / \ @ $
+ 
+ 
+ '''These markups are also useful for a content author to add ''disambiguation'' related metadata information in order to disambiguate a text / PCDATA in between "<" and ">"  from element tags''' . For an example, for the text say: ''Readers may refer to work in <Saha2005> for more information''. 
+ Please note that though '''<Saha2005>''' looks identical to an element tag but it is not intended to mean it as an element tag. Rather, it is meant for readers' references only. How to convey such disambiguation information to an XML Parser ?  Solution to this problem is to markup the text in the following way in order to denote that '''<Saha2005>''' is not meant for an element tag.
+ 
+ {{{
+ 
+ <!-- Markup to Disambiguate between an element-tag and a text/PCDATA in between "<" and ">" -->
+   
+ Readers may refer to work in 
+ <pos_cat name="punctuation" type="left_parenthesis"> < </pos_cat> Saha2005 
+ <pos_cat name="punctuation" type="right_parenthesis"> > </pos_cat> > 
+ for more information.
+ 
+ }}}
+   
  
  
  =='''Understanding Sentence-Level Markups:-'''==
Received on Friday, 14 October 2005 10:09:47 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:12:45 GMT