W3C home > Mailing lists > Public > public-i18n-its@w3.org > July to September 2005

[ESW Wiki] Update of "its0505WordCount" by AndrzejZydron

From: <w3t-archive+esw-wiki@w3.org>
Date: Wed, 06 Jul 2005 06:44:48 -0000
To: w3t-archive+esw-wiki@w3.org
Message-ID: <20050706064448.32513.49049@localhost.localdomain>
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by AndrzejZydron:
http://esw.w3.org/topic/its0505WordCount


------------------------------------------------------------------------------
  
  Author: Andrzej Zydron
  
- = Requirements for Metrics =
+ = I18N Metrics =
+ 
+ === Summary ===
+ 
+ We tend to use term 'word counts' to encompass all metrics to do with the i18n process. This is not an accurate definition though. A better term is ''metrics'' as for some scripts character counts are the only currently available metrics and the term 'word counts' ignores all other metrics such as page, file, line, screen counts.
+ 
+ Metrics are an important aspect of sizing the i18n task for a given XML document. Metrics encompass word and character counts as well as other relevant counts (pages, lines, screens). There are two aspects here. The first is to decide what will be counted (e.g. ''Localization Directives''). The second is the actual ''syntax'' of the count - The format of the metrics information (e.g. ''namespace syntax'').
+ 
+ === Challenge/issue ===
+ 
+ From the W3C ITS perspective metrics need to be governed by an indication of what aspects of a document (translatable attributes, PCDATA) need to be counted, as well as translated. This presupposes the existence of some form of ''Localization Directives'' definition document that would detail this information, plus additional ''translation/count'' directives within the document itself:
+ 
+ {{{
+ <para>This is a section of java code :
+ <programlisting>
+   // this string is never used
+   String sqlConnect = "connect / as sysdba";
+   String sqlSelect = "select name from mytable where name=\"Tim\"";
+   System.out.println("<message translate="yes">But of course, you should translate this string !</message>");
+ </programlisting>
+ }}}
+ 
+ 
+ == Notes ==
+ One of the most enduring features of the GILT (Globalization, Internationalization, Localization and Translation) industry has been the inconsistency of word counts, not only between rival products, but even between different versions of the same product. 
+ Word Counts are the traditional metric for i18n tasks, but they do not provide in themselves a valid unit
+ of measure. For Logographic scripts such as Chinese, Japanese and Korean word counts are very difficult if
+ not impossible to establish. A better term is Globalization Industry Metrics, or GILT Metrics. Volume is also
+ not a absolute indicator of the size of an i18n task - complexity and quality are other important ingredients.
+ 
+ The issue of GILT Metrics is currently being addressed by the proposed LISA OSCAR tripartite GILT Metrics standards which encompass volume (GMX/V), complexity (GMX/C) and quality (GMX/Q) - http://www.lisa.org/oscar/gmx/.

+ 
+ == Previous comments ==
+ 
  Metrics are an important aspect of sizing the i18n task for a given XML document. Metrics encompass word and character counts as well as other relevant counts (pages, lines, screens). There are two aspects here. The first is to decide what will be counted (e.g. ''Localization Directives''). The second is the actual ''syntax'' of the count - The fomat of the metrics information (e.g. ''namespace syntax'').
  
  '''[AZ-''' Just a quick note. We tend to use term 'word counts' to encompass all metrics to do with the i18n process. This is not an accurate definition though. A better term is ''metrics'' as for some scripts character counts are the only currently available metrics and the term 'word counts' ignores all other metrics such as page, file, line, screen counts.
@@ -63, +96 @@

  
  == Challenge/Issue ==
  
- One of the most enduring features of the GILT (Globalization, Internationalization, Localization and Translation) industry has been the inconsistency of word counts, not only between rival products, but even between different versions of the same product.
+ One of the most enduring features of the GILT (Globalization, Internationalization, Localization and Translation) industry has been the inconsistency of word counts, not only between rival products, but even between different versions of the same product. 
  
  Word Counts are the traditional metric for i18n tasks, but they do not provide in themselves a valid unit
  of measure. For Logographic scripts such as Chinese, Japanese and Korean word counts are very difficult if
  not impossible to establish. A better term is Globalization Industry Metrics, or GILT Metrics. Volume is also
  not a absolute indicator of the size of an i18n task - complexity and quality are other important ingredients.
  
- == Notes ==
- 
- The issue of GILT Metrics is currently being addressed by the proposed LISA OSCAR tripartite GILT Metrics standards which encompass volume (GMX/V), complexity (GMX/C) and quality (GMX/Q) - http://www.lisa.org/oscar/gmx/.

- 
Received on Wednesday, 6 July 2005 08:45:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 8 January 2008 14:12:45 GMT