- From: CVS User fsasaki <cvsmail@w3.org>
- Date: Sun, 22 Sep 2013 17:28:04 +0000
- To: public-multilingualweb-lt-commits@w3.org
Update of /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/TR-version In directory gil:/tmp/cvs-serv21615/TR-version Modified Files: Overview.html Log Message: publication date now 25 September - doing pub prep fixes --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/TR-version/Overview.html 2013/09/17 17:16:57 1.122 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/TR-version/Overview.html 2013/09/22 17:28:04 1.123 @@ -5,13 +5,13 @@ </style><link rel="stylesheet" href="local.css" type="text/css"/><link rel="stylesheet" type="text/css" href="http://www.w3.org/StyleSheets/TR/W3C-PR.css"/></head><body><div class="head"><p><a href="http://www.w3.org/"><img src="http://www.w3.org/Icons/w3c_home" alt="W3C" height="48" width="72"/></a></p> <h1><a name="title" id="title"></a>Internationalization Tag Set (ITS) Version 2.0</h1> -<h2><a name="w3c-doctype" id="w3c-doctype"></a>W3C Proposed Recommendation 19 September 2013</h2><dl><dt>This version:</dt><dd> - <a href="http://www.w3.org/TR/2013/PR-its20-20130919/"> - http://www.w3.org/TR/2013/PR-its20-20130919/</a> +<h2><a name="w3c-doctype" id="w3c-doctype"></a>W3C Proposed Recommendation 24 September 2013</h2><dl><dt>This version:</dt><dd> + <a href="http://www.w3.org/TR/2013/PR-its20-20130924/"> + http://www.w3.org/TR/2013/PR-its20-20130924/</a> </dd><dt>Latest version:</dt><dd> <a href="http://www.w3.org/TR/its20/">http://www.w3.org/TR/its20/</a> </dd><dt>Previous version:</dt><dd><a href="http://www.w3.org/TR/2013/WD-its20-20130820/"> - http://www.w3.org/TR/2013/WD-its20-20130820/</a></dd><dt>Editors:</dt><dd>David Filip, University of Limerick</dd><dd>Shaun McCance, Invited Expert</dd><dd>Dave Lewis, TCD</dd><dd>Christian Lieske, SAP AG</dd><dd>Arle Lommel, DFKI</dd><dd>Jirka Kosek, UEP</dd><dd>Felix Sasaki, DFKI / W3C Fellow</dd><dd>Yves Savourel, ENLASO</dd></dl><p>This document is also available in these non-normative formats: <a href="its20.odd">ODD/XML document</a>, <a href="itstagset20.zip">self-contained zipped archive</a>, and <a href="diffs/diff-wd20130919-wd20130820.html">XHTML Diff markup to previous publication + http://www.w3.org/TR/2013/WD-its20-20130820/</a></dd><dt>Editors:</dt><dd>David Filip, University of Limerick</dd><dd>Shaun McCance, Invited Expert</dd><dd>Dave Lewis, TCD</dd><dd>Christian Lieske, SAP AG</dd><dd>Arle Lommel, DFKI</dd><dd>Jirka Kosek, UEP</dd><dd>Felix Sasaki, DFKI / W3C Fellow</dd><dd>Yves Savourel, ENLASO</dd></dl><p>This document is also available in these non-normative formats: <a href="its20.odd">ODD/XML document</a>, <a href="itstagset20.zip">self-contained zipped archive</a>, and <a href="diffs/diff-wd20130924-wd20130820.html">XHTML Diff markup to previous publication 2013-08-20</a>.</p><p class="copyright"><a href="http://www.w3.org/Consortium/Legal/ipr-notice#Copyright">Copyright</a> © 2013 <a href="http://www.w3.org/"><acronym title="World Wide Web Consortium">W3C</acronym></a><sup>®</sup> (<a href="http://www.csail.mit.edu/"><acronym title="Massachusetts Institute of Technology">MIT</acronym></a>, <a href="http://www.ercim.eu/"><acronym title="European Research Consortium for Informatics and Mathematics">ERCIM</acronym></a>, <a href="http://www.keio.ac.jp/">Keio</a>, <a href="http://ev.buaa.edu.cn/">Beihang</a>), All Rights Reserved. W3C <a href="http://www.w3.org/Consortium/Legal/ipr-notice#Legal_Disclaimer">liability</a>, <a href="http://www.w3.org/Consortium/Legal/ipr-notice#W3C_Trademarks">trademark</a> and <a href="http://www.w3.org/Consortium/Legal/copyright-documents">document use</a> rules apply.</p></div><hr/><div> <h2><a name="abstract" id="abstract"></a>Abstract</h2><p>The technology described in this document “<em>Internationalization Tag Set (ITS) 2.0</em>“ enhances the foundation to integrate automated processing of human language @@ -36,7 +36,7 @@ document to Recommendation status (see <a href="http://www.w3.org/2004/02/Process-20040205/tr.html#maturity-levels">W3C document maturity levels</a>).</p><p>The ITS 2.0 specification has a normative dependency on the HTML5 specification: it relies on the <a href="http://www.w3.org/TR/2013/CR-html5-20130806/dom.html#the-translate-attribute">HTML5 Translate attribute</a>. By publishing this ITS 2.0 Proposed Recommendation, W3C expects that the functionality specified in ITS 2.0 will not be affected by changes to HTML5 as HTML5 proceeds to Recommendation.</p><p> The W3C Membership and other interested parties are invited to review the document and send comments to <a href="mailto:public-multilingualweb-lt-comments@w3.org">public-multilingualweb-lt-comments@w3.org</a>. Use "Comment on ITS 2.0 specification WD" in the subject line of your email. The <a href="http://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/">archives - for this list</a> are publicly available. Advisory Committee Representatives should consult their <a href="https://www.w3.org/2002/09/wbs/myQuestionnaires">WBS questionnaires</a>. The deadline for review and comments is 15 October 2013. See also <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/">issues discussed + for this list</a> are publicly available. Advisory Committee Representatives should consult their <a href="https://www.w3.org/2002/09/wbs/myQuestionnaires">WBS questionnaires</a>. The deadline for review and comments is 22 October 2013. See also <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/">issues discussed within the Working Group</a> and the <a href="#changelog-since-20130820">list of changes since the previous publication</a>.</p><p>Publication as a Proposed Recommendation does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.</p><p>This document was produced by a group operating under the <a href="http://www.w3.org/Consortium/Patent-Policy-20040205/">5 February 2004 W3C Patent Policy</a>. W3C maintains a <a href="http://www.w3.org/2004/01/pp-impl/53116/status">public list of any patent disclosures</a> made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains <a href="http://www.w3.org/Consortium/Patent-Policy-20040205/#def-essential">Essential Claim(s)</a> must disclose the information in accordance with <a href="http://www.w3.org/Consortium/Patent-Policy-2004205/#sec-Disclosure">section 6 of the W3C Patent Policy</a>. </p></div><div class="toc"> <h2><a name="contents" id="contents"></a>Table of Contents</h2><div class="toc"><div class="toc1">1 <a href="#introduction">Introduction</a><div class="toc2">1.1 <a href="#overview">Overview</a></div> @@ -1784,7 +1784,7 @@ </p></li></ul></div><div class="div3"> <h4><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="within-text-implementation" id="within-text-implementation"></a>8.7.2 Implementation</h4><p>The <a href="#elements-within-text">Elements Within Text</a> data category can be expressed with global rules, or locally on an individual element. There is no - inheritance.</p><p>For XML: The default is that elements are not within text.</p><p id="html5-withintext-handling">For HTML: The default is that elements are not within text, with the following exceptions:</p><ul><li><p>For the elements that are part of the <a href="http://www.w3.org/TR/2012/CR-html5-20130806/dom.html#phrasing-content-1">HTML5 phrasing content</a> the + inheritance.</p><p>For XML: The default is that elements are not within text.</p><p id="html5-withintext-handling">For HTML: The default is that elements are not within text, with the following exceptions:</p><ul><li><p>For the elements that are part of the <a href="http://www.w3.org/TR/2013/CR-html5-20130806/dom.html#phrasing-content-1">HTML5 phrasing content</a> the default is <code>withinText="yes"</code>, with the following exceptions:</p><ul><li><p>For the elements <code class="its-elem-markup">iframe</code>, <code class="its-elem-markup">noscript</code>, <code class="its-elem-markup">script</code> and <code class="its-elem-markup">textarea</code> the default is <code>withinText="nested"</code>.</p></li></ul></li></ul><div class="exampleOuter"><div class="exampleHeader"><a name="EX-within-text-defaults-html5-1" id="EX-within-text-defaults-html5-1"></a>Example 46: Illustrates the defaults for the <a href="#elements-within-text">Elements Within Text</a> data category in HTML.</div><p>In this document the different flows of text are the following (brackets indicating inline or nested elements):<br/><code><br/> - "Elements within Text defaults for HTML5"<br/> - "The element p is not within text. But [the element em is]."<br/> @@ -2454,7 +2454,7 @@ <strong class="hl-tag" style="color: #000096"><body></strong> <strong class="hl-tag" style="color: #000096"><video</strong> <span class="hl-attribute" style="color: #F5844C">height</span>=<span class="hl-value" style="color: #993300">360</span> - <span class="hl-attribute" style="color: #F5844C">poster</span>=<span class="hl-value" style="color: #993300">video-image.png</span> + <span class="hl-attribute" style="color: #F5844C">poster</span>=<span class="hl-value" style="color: #993300">http://www.example.com/video-image.png</span> <span class="hl-attribute" style="color: #F5844C">src</span>=<span class="hl-value" style="color: #993300">http://www.example.com/video/v2.mp</span> <span class="hl-attribute" style="color: #F5844C">width</span>=<span class="hl-value" style="color: #993300">640</span><strong class="hl-tag" style="color: #000096">></strong> <strong class="hl-tag" style="color: #000096"><p></strong>If your browser doesn't support @@ -2779,7 +2779,7 @@ <span class="hl-attribute" style="color: #F5844C">its-loc-quality-issue-comment</span>=<span class="hl-value" style="color: #993300">"should be 'quality'"</span> <span class="hl-attribute" style="color: #F5844C">its-loc-quality-issue-profile-ref</span>=<span class="hl-value" style="color: #993300">grammar</span> <span class="hl-attribute" style="color: #F5844C">its-loc-quality-issue-severity</span>=<span class="hl-value" style="color: #993300">50</span> - <span class="hl-attribute" style="color: #F5844C">its-loc-quality-issue-type</span>=<span class="hl-value" style="color: #993300">spelling</span><strong class="hl-tag" style="color: #000096">></strong>qulaity<strong class="hl-tag" style="color: #000096"></span></strong> with his instrument, + <span class="hl-attribute" style="color: #F5844C">its-loc-quality-issue-type</span>=<span class="hl-value" style="color: #993300">misspelling</span><strong class="hl-tag" style="color: #000096">></strong>qulaity<strong class="hl-tag" style="color: #000096"></span></strong> with his instrument, a standard that would not only satisfy listeners but that would overcome all the flaws of traditional instruments.<strong class="hl-tag" style="color: #000096"></p></strong> <strong class="hl-tag" style="color: #000096"></body></strong> @@ -2935,9 +2935,11 @@ <strong class="hl-tag" style="color: #000096"><body</strong> <span class="hl-attribute" style="color: #F5844C">its-annotators-ref</span>=<span class="hl-value" style="color: #993300">"mt-confidence|file:///tools.xml#T1"</span><strong class="hl-tag" style="color: #000096">></strong> <strong class="hl-tag" style="color: #000096"><p></strong> <strong class="hl-tag" style="color: #000096"><img</strong> <span class="hl-attribute" style="color: #F5844C">src</span>=<span class="hl-value" style="color: #993300">"http://upload.wikimedia.org/wikipedia/commons/9/93/Trinity_College.jpg"</span> - <span class="hl-attribute" style="color: #F5844C">title</span>=<span class="hl-value" style="color: #993300">"Front gate of Trinity College Dublin"</span><strong class="hl-tag" style="color: #000096">/></strong> + <span class="hl-attribute" style="color: #F5844C">title</span>=<span class="hl-value" style="color: #993300">"Front gate of Trinity College Dublin"</span> + <span class="hl-attribute" style="color: #F5844C">alt</span>=<span class="hl-value" style="color: #993300">"alternative description"</span><strong class="hl-tag" style="color: #000096">/></strong> <strong class="hl-tag" style="color: #000096"><img</strong> <span class="hl-attribute" style="color: #F5844C">src</span>=<span class="hl-value" style="color: #993300">"http://upload.wikimedia.org/wikipedia/commons/c/cc/Molly_alone.jpg"</span> - <span class="hl-attribute" style="color: #F5844C">title</span>=<span class="hl-value" style="color: #993300">"A tart with a cart"</span><strong class="hl-tag" style="color: #000096">/></strong> + <span class="hl-attribute" style="color: #F5844C">title</span>=<span class="hl-value" style="color: #993300">"A tart with a cart"</span> + <span class="hl-attribute" style="color: #F5844C">alt</span>=<span class="hl-value" style="color: #993300">"alternative description"</span><strong class="hl-tag" style="color: #000096">/></strong> <strong class="hl-tag" style="color: #000096"></p></strong> <strong class="hl-tag" style="color: #000096"></body></strong> <strong class="hl-tag" style="color: #000096"></html></strong></pre></div><p>[Source file: <a href="examples/html5/EX-mtConfidence-global-html5-1.html">examples/html5/EX-mtConfidence-global-html5-1.html</a>]</p></div><p>Where the external ITS rules file is as shown:</p><div class="exampleOuter"><div class="exampleHeader"><a name="EX-mtconfidence-global-html5-1-external-rules" id="EX-mtconfidence-global-html5-1-external-rules"></a>Example 80: XML file with external rules references from an HTML file.</div><div class="exampleInner"><pre><span class="hl-directive" style="color: maroon"><?xml version="1.0" encoding="UTF-8"?></span> @@ -5616,8 +5618,7 @@ Version 1.0</cite></a>. W3C Recommendation 16 November 1999. Available at <a href="http://www.w3.org/TR/1999/REC-xslt-19991116"> http://www.w3.org/TR/1999/REC-xslt-19991116</a>. The latest version of <a href="http://www.w3.org/TR/xslt">XSLT 1.0</a> is available at http://www.w3.org/TR/xslt.</dd><dt class="label"><a name="xul" id="xul"/>XUL</dt><dd> - <a href="http://www.xulplanet.com/"><cite>exTensible User Interface Language</cite></a>. Available at <a href="http://www.xulplanet.com/"> - http://www.xulplanet.com/</a>.</dd></dl></div><div class="div1"> + <a href="https://developer.mozilla.org/en-US/docs/XUL"><cite>exTensible User Interface Language</cite></a>. Available at <a href="https://developer.mozilla.org/en-US/docs/XUL">https://developer.mozilla.org/en-US/docs/XUL</a>.</dd></dl></div><div class="div1"> <h2><a href="#contents"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="conversion-to-nif" id="conversion-to-nif"></a>F Conversion to NIF</h2><p> <em>This section is informative.</em> </p><p>This section provides an informative algorithm to convert XML or HTML documents (or their DOM @@ -5629,9 +5630,13 @@ predicates. A normalized example is given below. The whitespace normalization algorithm itself is format dependent (for example, it differs for HTML compared to general XML).</p></div><div class="note"><p class="prefix"><b>Note:</b></p><p id="its-rdf-ontology-status">The output of the algorithm shown below uses the ITS RDF ontology <a title="ITS RDF Ontology" href="#its-rdf-ontology">[ITS RDF]</a> and its namespace<br/><a href="http://www.w3.org/2005/11/its/rdf#">http://www.w3.org/2005/11/its/rdf#</a> - <br/>Like the algorithm, this ontology is not a normative part of the ITS 2.0 specification and is being discussed in the <a href="http://www.w3.org/International/its/wiki/ITS-RDF_mapping">ITS Interest Group</a>.</p></div><div class="exampleOuter"><div class="exampleHeader"><a name="EX-HTML-whitespace-normalization" id="EX-HTML-whitespace-normalization"></a>Example 97: Example (see <a href="examples/html5/EX-HTML-whitespace-normalization.html">source code</a>) of an HTML document with whitespace character normalization as preparation for the conversion to NIF</div><div class="exampleInner"><pre><strong class="hl-tag" style="color: #000096"><html></strong><strong class="hl-tag" style="color: #000096"><body></strong><strong class="hl-tag" style="color: #000096"><h2</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Welcome to <strong clas="hl-tag" style="color: #000096"><span</strong> - <span class="hl-attribute" style="color: #F5844C">its-ta-ident-ref</span>=<span class="hl-value" style="color: #993300">"http://dbpedia.org/resource/Dublin"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span> - <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">></strong>Dublin<strong class="hl-tag" style="color: #000096"></span></strong> in <strong class="hl-tag" style="color: #000096"><b</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Ireland<strong class="hl-tag" style="color: #000096"></b></strong>!<strong class="hl-tag" style="color: #000096"></h2></strong><strong class="hl-tag" style="color: #000096"></body></strong><strong class="hl-tag" style="color: #000096"></html></strong></pre></div></div><p id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven + <br/>Like the algorithm, this ontology is not a normative part of the ITS 2.0 specification and is being discussed in the <a href="http://www.w3.org/International/its/wiki/ITS-RDF_mapping">ITS Interest Group</a>.</p></div><div class="exampleOuter"><div class="exampleHeader"><a name="EX-HTML-whitespace-normalization" id="EX-HTML-whitespace-normalization"></a>Example 97: Example (see <a href="examples/html5/EX-HTML-whitespace-normalization.html">source code</a>) of an HTML document with whitespace character normalization as preparation for the conversion to NIF. Note that text nodes in the <code>head</code> element are not taken into account.</div><div class="exampleInner"><pre><strong class="hl-tag" style="color: blue"><!DOCTYPE html></strong><strong class="hl-tag" style="color: #000096"><html</strong> <span class="hl-attribute" style="color: #F5844C">xmlns</span>=<span class="hl-value" style="color: #993300">"http://www.w3.org/1999/xhtml"</span><strong class="hl-tag" style="color: 000096">></strong> +<strong class="hl-tag" style="color: #000096"><head></strong><strong class="hl-tag" style="color: #000096"><meta</strong> <span class="hl-attribute" style="color: #F5844C">http-equiv</span>=<span class="hl-value" style="color: #993300">"Content-Type"</span> <span class="hl-attribute" style="color: #F5844C">content</span>=<span class="hl-value" style="color: #993300">"text/html;charset=utf-8"</span><strong class="hl-tag" style="color: #000096"> ></strong> +<strong class="hl-tag" style="color: #000096"><title></strong>NIF conversion example<strong class="hl-tag" style="color: #000096"></title></strong><strong class="hl-tag" style="color: #000096"></head></strong> +<strong class="hl-tag" style="color: #000096"><body></strong><strong class="hl-tag" style="color: #000096"><h2</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Welcome to <strong class="hl-tag" style="color: #000096"><span</strong> + <span class="hl-attribute" style="color: #F5844C">its-ta-ident-ref</span>=<span class="hl-value" style="color: #993300">"http://dbpedia.org/resource/Dublin"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span> + <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">></strong>Dublin<strong class="hl-tag" style="color: #000096"></span></strong> in <strong class="hl-tag" style="color: #000096"><b</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Ireland<strong class="hl-tag" style="color: #000096"></b></strong>!<strong class="hl-tag" style="color: #000096"></h2></strong><strong class="hl-tag" style="color: #000096"></body></strong><strong class="hl-tag" style="color: #000096"></html></strong> +</pre></div></div><p id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven steps:</p><ul><li><p id="its2nif-algorithm-step1">STEP 1: Get an ordered list of all text nodes of the document.</p></li><li><p id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each non-empty text node of all leaf elements and memorize them.</p></li><li><p id="its2nif-algorithm-step3">STEP 3: Get the text for each text node and make a tuple with the corresponding XPath expression (X,T). Since the text nodes have a certain order we now have a list of ordered tuples ((x0,t0), (x1,t1), ..., (xn,tn)).</p></li><li><p id="its2nif-algorithm-step4">STEP 4 (optional): Serialize as XML or as RDF.
Received on Sunday, 22 September 2013 17:28:06 UTC