W3C home > Mailing lists > Public > public-multilingualweb-lt-commits@w3.org > April 2013

CVS WWW/International/multilingualweb/lt/drafts/its20

From: CVS User fsasaki <cvsmail@w3.org>
Date: Wed, 10 Apr 2013 15:13:58 +0000
Message-Id: <E1UPwik-00005i-5M@gil.w3.org>
To: public-multilingualweb-lt-commits@w3.org
Update of /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20
In directory gil:/tmp/cvs-serv32710

Modified Files:
	its20.html its20.odd 
Log Message:
publication preparation

--- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html	2013/04/09 20:57:08	1.385
+++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html	2013/04/10 15:13:57	1.386
@@ -1140,20 +1140,18 @@
               process and its corresponding external rules file.</p></li></ul></div><div class="div2">
 <h3><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="conversion-to-nif" id="conversion-to-nif" shape="rect"/>5.7 Conversion to NIF</h3><p>This section defines an algorithm to convert XML or HTML documents (or their DOM
             representations) that contain ITS metadata to the RDF-based format based on <a title="" href="#NIF" shape="rect">[NIF]</a>. The conversion results in RDF triples.</p><div class="note"><p class="prefix"><b>Note:</b></p><p>The algorithm is intended to extract the text from the XML/HTML/DOM for an NLP
-              tool and can produce a lot of "<span class="quote">phantom</span>" predicates from excessive
+              tool. It can produce a lot of "<span class="quote">phantom</span>" predicates from excessive
               whitespace, which 1) increases the size of the intermediate mapping and 2) extracts
-              this whitespace as text. This might decrease NLP performance. It is recommended to
+              this whitespace as text, and therefore might decrease NLP performance. It is strongly recommended to
               normalize whitespace in the input XML/HTML/DOM in order to minimize such phantom
-              predicates. A normalized example is given below. The whitespace normalization
-              algorithm itself is format dependent, e.g. it differs for HTML compared to general
-              XML. Hence no normative algorithm for whitespace normalization is given as part of
-              this specification.</p></div><div class="exampleOuter"><div class="exampleHeader"><a name="EX-HTML-whitespace-normalization" id="EX-HTML-whitespace-normalization" shape="rect"/>Example 25: Example of an HTML document with whitespace normalized as preparation for
-              conversion to NIF</div><div class="exampleInner"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096">&lt;html&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;body&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;h2</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Welcome to <strong class="hl-tag" style="color: #000096">&lt;span</strong> 
+              predicates. A normalized example is given below. Since the whitespace normalization
+              algorithm itself is format dependent, for example, it differs for HTML compared to general
+              XML, no normative algorithm for whitespace normalization is given as part of
+              this specification.</p></div><div class="exampleOuter"><div class="exampleHeader"><a name="EX-HTML-whitespace-normalization" id="EX-HTML-whitespace-normalization" shape="rect"/>Example 25: Example of an HTML document with whitespace character normalization as preparation for the conversion to NIF</div><div class="exampleInner"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096">&lt;html&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;body&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;h2</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Welcome to <strong class="hl-tag" style="color: #000096">&lt;span</strong> 
    <span class="hl-attribute" style="color: #F5844C">its-ta-ident-ref</span>=<span class="hl-value" style="color: #993300">"http://dbpedia.org/resource/Dublin"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span>
    <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Dublin<strong class="hl-tag" style="color: #000096">&lt;/span&gt;</strong> in <strong class="hl-tag" style="color: #000096">&lt;b</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Ireland<strong class="hl-tag" style="color: #000096">&lt;/b&gt;</strong>!<strong class="hl-tag" style="color: #000096">&lt;/h2&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;/body&gt;</strong><strong class="hl-tag" style="color: #000096">&lt;/html&gt;</strong></pre></div></div><p id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven
             steps.</p><ul><li><p id="its2nif-algorithm-step1">STEP 1: Get an ordered list of all text nodes
-                of the document.</p></li><li><p id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each
-                non-empty text node of all leaf elements and remember them.</p></li><li><p id="its2nif-algorithm-step3">STEP 3: Get the text for each node and make a
+                of the document.</p></li><li><p id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each non-empty text node of all leaf elements and memorize them.</p></li><li><p id="its2nif-algorithm-step3">STEP 3: Get the text for each node and make a
                 tuple with the XPath expressions (X,T). Since the text nodes have a certain order we
                 now have a list of ordered tuples ((x0,t0), (x1,t1), ..., (xn,tn)).</p></li><li><p id="its2nif-algorithm-step4">STEP 4 (optional): Serialize as XML or as RDF.
                 The list with the XPath-to-text mapping can also be kept in memory. Part of a
@@ -1204,31 +1202,27 @@
   &lt;mapping x="xpath(/html/body[1]/h2[1]/text()[3])" b="28" e="29" /&gt;
   &lt;mapping x="xpath(/html/body[1]/h2[1])" b="0" e="29" /&gt;
 &lt;/mappings&gt;</pre></div></div><ul><li><p id="its2nif-algorithm-step5">STEP 5: Create a context URI and attach the
-                whole concatenated text of the document as reference.</p></li><li><p id="its2nif-algorithm-step6">STEP 6: Now attach any ITS metadata items from
-                the XML/HTML/DOM input to respective NIF URIs.</p></li><li><p id="its2nif-algorithm-step7">STEP 7: Omit all irrelevant URIs (those that
-                do not carry annotations, they will just bloat the data).</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: &lt;http://www.w3.org/2005/11/its/rdf#&gt; .
+                whole concatenated text of the document as reference.</p></li><li><p id="its2nif-algorithm-step6">STEP 6: Attach any ITS metadata annotations from the XML/HTML/DOM input to the respective NIF URIs.</p></li><li><p id="its2nif-algorithm-step7">STEP 7: Omit all URIs that do not carry annotations (they will just bloat the data).</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: &lt;http://www.w3.org/2005/11/its/rdf#&gt; .
+@prefix nif: &lt;http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#&gt;
 &lt;http://example.com/exampledoc.html#char=0,29&gt;
-    rdf:type             str:Context ;
-    rdf:type             str:OffsetBasedString ;
+    rdf:type             nif:Context ;
+    rdf:type             nif:RFC5147String ;
 # concatenate the whole text
-    str:isString         "$(t0+t1+t2+...+tn)" ; 
+    nif:isString         "$(t0+t1+t2+...+tn)" ; 
     itsrdf:translate     "yes";
-    str:occursIn      &lt;http://example.com/exampledoc.html&gt; .
+    nif:occursIn      &lt;http://example.com/exampledoc.html&gt; .
 &lt;http://example.com/exampledoc.html#char=11,17&gt; 
-    rdf:type              str:String ;
-    rdf:type              str:OffsetBasedString ;
+    rdf:type              nif:RFC5147String;
+    rdf:type              nif:RFC5147String ;
     itsrdf:translate     "no";
     itsrdf:taIdentRef  &lt;http://dbpedia.org/resource/Dublin&gt; ;
-    str:referenceContext &lt;http://example.com/exampledoc.html#char=0,29&gt; .
+    nif:referenceContext &lt;http://example.com/exampledoc.html#char=0,29&gt; .
 &lt;http://example.com/exampledoc.html#char=21,28&gt; 
-    rdf:type              str:String ;
-    rdf:type              str:OffsetBasedString ;
+    rdf:type              nif:RFC5147String;
+    rdf:type              nif:RFC5147String ;
     itsrdf:translate     "no";
-    str:referenceContext &lt;http://example.com/exampledoc.html#char=0,29&gt; .
-</pre></div></div><p>A complete sample output in RDF/XML format after step 7, given the input document <a href="#EX-HTML-whitespace-normalization" shape="rect">Example 25</a>, is available at <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">examples/nif/EX-nif-conversion-output.xml</a>.</p><span class="editor-note">[Ed. note: above example needs to be updated with regards to "text analysis" output.]</span><div class="note"><p class="prefix"><b>Note:</b></p><p>The conversion to NIF is the basis for natural language processing (NLP)
-              applications, creating for example named entity annotations. A non-normative algorithm
-              to integrate these annotations into the original input document is given in <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix F: Conversion NIF2ITS</a>. The algorithm in that appendix is
-              non-normative since many choices depend on the actual NLP application.</p></div></div><div class="div2">
+    nif:referenceContext &lt;http://example.com/exampledoc.html#char=0,29&gt; .
+</pre></div></div><p>A complete sample output in RDF/XML format after step 7, given the input document <a href="#EX-HTML-whitespace-normalization" shape="rect">Example 25</a>, is available at <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">examples/nif/EX-nif-conversion-output.xml</a>.</p><div class="note"><p class="prefix"><b>Note:</b></p><p>The conversion to NIF is a possible basis for a natural language processing (NLP) application that creates, for example, named entity annotations. A non-normative algorithm to integrate these annotations into the original input document is given in <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix F: Conversion NIF2ITS</a>. This algorithm is non-normative because many decisions depend on the actually employed NLP application.</p></div></div><div class="div2">
 <h3><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="its-tool-annotation" id="its-tool-annotation" shape="rect"/>5.8 ITS Tools Annotation</h3><p>In some cases, it may be important for instances of data categories to be associated
             with information about the processor that generated them. For example, the score of the
               <a href="#mtconfidence" shape="rect">MT Confidence</a> data category (provided via the
@@ -1296,7 +1290,7 @@
                 <code>p</code> element is found in the element with <code>id="T2"</code> in the same
               document. In addition, <code class="its-attr-markup">annotatorsRef</code> is used to identify a Web resource
               with information about the QA tool used to generate the <a href="#lqissue" shape="rect">Localization Quality Issue</a> annotation in the document.</p><div class="exampleInner"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096">&lt;doc</strong> <span class="hl-attribute" style="color: #F5844C">its:version</span>=<span class="hl-value" style="color: #993300">"2.0"</span>
-    <span class="hl-attribute" style="color: #F5844C">its:annotatorsRef</span>=<span class="hl-value" style="color: #993300">"mt-confidence|file:///tools.xml#T1 lq-issue|http://www.qalsp-ex.com/qatools/transcheckv1.3"</span>
+    <span class="hl-attribute" style="color: #F5844C">its:annotatorsRef</span>=<span class="hl-value" style="color: #993300">"mt-confidence|file:///tools.xml#T1 localization-quality-issue|http://www.qalsp-ex.com/qatools/transcheckv1.3"</span>
     <span class="hl-attribute" style="color: #F5844C">xmlns:its</span>=<span class="hl-value" style="color: #993300">"http://www.w3.org/2005/11/its"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;p</strong> <span class="hl-attribute" style="color: #F5844C">its:mtConfidence</span>=<span class="hl-value" style="color: #993300">"0.78"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Text translated with tool T1<strong class="hl-tag" style="color: #000096">&lt;/p&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;p</strong> <span class="hl-attribute" style="color: #F5844C">its:mtConfidence</span>=<span class="hl-value" style="color: #993300">"0.55"</span> <span class="hl-attribute" style="color: #F5844C">its:locQualityIssueType</span>=<span class="hl-value" style="color: #993300">"typographical"</span>
@@ -3375,8 +3369,7 @@
             http://www.w3.org/TR/html401.</dd><dt class="label"><a name="html5" id="html5" shape="rect"/>HTML5</dt><dd>Robin Berjon et al. <a href="http://www.w3.org/TR/2012/CR-html5-20121217/" shape="rect"><cite>HTML5</cite></a>. W3C Candidate Recommendation 17 December 2012. Available at <a href="http://www.w3.org/TR/2012/CR-html5-20121217/" shape="rect">
               http://www.w3.org/TR/1999/REC-html401-19991224</a>. The latest version of <a href="http://www.w3.org/TR/html5/" shape="rect">HTML5</a> is available at
             http://www.w3.org/TR/html5/.</dd><dt class="label"><a name="ianacharsets" id="ianacharsets" shape="rect"/>IANA Character Sets</dt><dd>
-               <a href="http://www.iana.org/assignments/character-sets" shape="rect"><cite>Character Sets</cite></a> Available at <a href="http://www.iana.org/assignments/character-sets" shape="rect">http://www.iana.org/assignments/character-sets</a>.</dd><dt class="label"><a name="NIF" id="NIF" shape="rect"/>NIF</dt><dd>NIF Placeholder at
-            http://persistence.uni-leipzig.org/nlp2rdf/</dd><dt class="label"><a name="qa-framework" id="qa-framework" shape="rect"/>QAFRAMEWORK</dt><dd>Karl Dubost, Lynne Rosental, Dominique
+               <a href="http://www.iana.org/assignments/character-sets" shape="rect"><cite>Character Sets</cite></a> Available at <a href="http://www.iana.org/assignments/character-sets" shape="rect">http://www.iana.org/assignments/character-sets</a>.</dd><dt class="label"><a name="NIF" id="NIF" shape="rect"/>NIF</dt><dd>Hellmann, S.  (ed.). <a href="http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core" shape="rect">NIF 2.0 Core Ontology Version 1.0</a>, version April 2013. Available at http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core under CC-BY 2.0  license. </dd><dt class="label"><a name="qa-framework" id="qa-framework" shape="rect"/>QAFRAMEWORK</dt><dd>Karl Dubost, Lynne Rosental, Dominique
             Hazaël-Massieux, Lofton Henderson. <a href="http://www.w3.org/TR/2005/REC-qaframe-spec-20050817/" shape="rect"><cite>QA Framework:
                 Specification Guidelines</cite></a>. W3C Recommendation 17 August 2005. Available at <a href="http://www.w3.org/TR/2005/REC-qaframe-spec-20050817/" shape="rect">
               http://www.w3.org/TR/2005/REC-qaframe-spec-20050817/</a>. The latest version of <a href="http://www.w3.org/TR/qaframe-spec/" shape="rect">QAFRAMEWORK</a> is available at
@@ -4347,19 +4340,23 @@
         <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.withinText"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taConfidence"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taClassRef"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;group&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taSource"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdent"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;/group&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdentRef"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;interleave&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taConfidence"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;oneOrMore&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taClassRef"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;group&gt;</strong>
+                  <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taSource"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                  <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdent"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;/group&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdentRef"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;/oneOrMore&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;/interleave&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
         <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.localeFilterList"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
@@ -4417,12 +4414,12 @@
         <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
           <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssuesRef"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
           <strong class="hl-tag" style="color: #000096">&lt;interleave&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueType"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueComment"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;oneOrMore&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueType"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueComment"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;/oneOrMore&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
               <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueSeverity"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
@@ -4504,19 +4501,23 @@
         <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.withinText.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taConfidence.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taClassRef.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;group&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taSource.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdent.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;/group&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdentRef.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;interleave&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taConfidence.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;oneOrMore&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taClassRef.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;group&gt;</strong>
+                  <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taSource.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                  <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdent.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;/group&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.taIdentRef.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;/oneOrMore&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;/interleave&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
         <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.localeFilterList.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
@@ -4574,12 +4575,12 @@
         <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
           <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssuesRef.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
           <strong class="hl-tag" style="color: #000096">&lt;interleave&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueType.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
-              <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueComment.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-            <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;oneOrMore&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueType.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+                <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueComment.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
+              <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+            <strong class="hl-tag" style="color: #000096">&lt;/oneOrMore&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
               <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueSeverity.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
@@ -5099,14 +5100,12 @@
         <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssuesRefPointer.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;group&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;interleave&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;oneOrMore&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;choice&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueType.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
             <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueComment.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
-          <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
-        <strong class="hl-tag" style="color: #000096">&lt;/interleave&gt;</strong>
+          <strong class="hl-tag" style="color: #000096">&lt;/choice&gt;</strong>
+        <strong class="hl-tag" style="color: #000096">&lt;/oneOrMore&gt;</strong>
         <strong class="hl-tag" style="color: #000096">&lt;optional&gt;</strong>
           <strong class="hl-tag" style="color: #000096">&lt;ref</strong> <span class="hl-attribute" style="color: #F5844C">name</span>=<span class="hl-value" style="color: #993300">"its-attribute.locQualityIssueSeverity.nons"</span><strong class="hl-tag" style="color: #000096">/&gt;</strong>
         <strong class="hl-tag" style="color: #000096">&lt;/optional&gt;</strong>
@@ -5645,10 +5644,17 @@
     <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:*]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"ancestor-or-self::*/@its:version | //its:rules/@version"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>The version is indicated by the ITS version attribute. This attribute is mandatory for the rules element, where it MUST be in no namespace. If there is no rules element in an XML document, a prefixed ITS version attribute (e.g. its:version) MUST on the element where the ITS markup is used, or on one of its ancestors.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"its:provenanceRecords | its:locQualityIssues"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"self::*/@version | ancestor::*/@its:version | //its:rules/@version"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>The version is indicated by the ITS version attribute. This attribute is mandatory for the rules element, where it MUST be in no namespace. If there is no rules element in an XML document, a prefixed ITS version attribute (e.g. its:version) MUST on the element where the ITS markup is used, or on one of its ancestors. For standoff markup unprefixed version attribute is used.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:version]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"if (@its:version and //its:rules/@version) then //its:rules/@version = @its:version else true()"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>There MUST NOT be two different versions of ITS in the same document.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
       <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"every $v in //*/@its:version satisfies $v = @its:version"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>There MUST NOT be two different versions of ITS in the same document.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"its:provenanceRecords | its:locQualityIssues"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"if (@version and //its:rules/@version) then //its:rules/@version = @version else true()"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>There MUST NOT be two different versions of ITS in the same document.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"every $v in //*/@its:version satisfies $v = @version"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>There MUST NOT be two different versions of ITS in the same document.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;/pattern&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;pattern&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;title&gt;</strong>Global, Rule-based Selection<strong class="hl-tag" style="color: #000096">&lt;/title&gt;</strong>
@@ -5665,13 +5671,25 @@
   <strong class="hl-tag" style="color: #000096">&lt;pattern&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;title&gt;</strong>ITS Tools Annotation<strong class="hl-tag" style="color: #000096">&lt;/title&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:annotatorsRef]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"every $ref in tokenize(@annotatorsRef, '\s+') satisfies matches($ref, '(translate|localization-note|terminology|directionality|ruby|language-information|elements-within-text|domain|disambiguation|locale-filter|provenance|external-resource|target-pointer|id-value|preserve-space|localization-quality-issue|localization-quality-rating|mt-confidence|allowed-characters|storage-size)\|.+')"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>The value of annotatorsRef is a space-separated list of references where each reference is composed of two parts: a data category identifier and an IRI. These two parts are separated by a character | VERTICAL LINE (U+007C).<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"every $ref in tokenize(@its:annotatorsRef, '\s+') satisfies matches($ref, '(translate|localization-note|terminology|directionality|ruby|language-information|elements-within-text|domain|text-analysis|locale-filter|provenance|external-resource|target-pointer|id-value|preserve-space|localization-quality-issue|localization-quality-rating|mt-confidence|allowed-characters|storage-size)\|.+')"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>The value of annotatorsRef is a space-separated list of references where each reference is composed of two parts: a data category identifier and an IRI. These two parts are separated by a character | VERTICAL LINE (U+007C).<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
+  <strong class="hl-tag" style="color: #000096">&lt;/pattern&gt;</strong>
+  <strong class="hl-tag" style="color: #000096">&lt;pattern&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;title&gt;</strong>Source of confidence<strong class="hl-tag" style="color: #000096">&lt;/title&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:termConfidence]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"ancestor-or-self::*[@its:annotatorsRef][matches(@its:annotatorsRef, '.*\s*terminology\|.+')]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Any node selected by the terminology data category with the termConfidence attribute specified MUST be contained in an element with the annotatorsRef attribute specified for the Terminology data category.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:taConfidence]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"ancestor-or-self::*[@its:annotatorsRef][matches(@its:annotatorsRef, '.*\s*text-analysis\|.+')]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Any node selected by the Text Analysis data category with the taConfidence attribute specified MUST be contained in an element with the annotatorsRef attribute specified for the Text Analysis data category.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"*[@its:mtConfidence]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"ancestor-or-self::*[@its:annotatorsRef][matches(@its:annotatorsRef, '.*\s*mt-confidence\|.+')]"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Any node selected by the MT Confidence data category MUST be contained in an element with the annotatorsRef attribute specified for the MT Confidence data category.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;/pattern&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;pattern&gt;</strong>
-    <strong class="hl-tag" style="color: #000096">&lt;title&gt;</strong>Disambiguation<strong class="hl-tag" style="color: #000096">&lt;/title&gt;</strong>
-    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"its:disambiguationRule"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
-      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"@disambigClassPointer | @disambigClassRefPointer | @disambigSourcePointer | @disambigIdentPointer | @disambigIdentRefPointer"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Disambiguation rule must specify at least target type class or target identity.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;title&gt;</strong>Text analysis<strong class="hl-tag" style="color: #000096">&lt;/title&gt;</strong>
+    <strong class="hl-tag" style="color: #000096">&lt;rule</strong> <span class="hl-attribute" style="color: #F5844C">context</span>=<span class="hl-value" style="color: #993300">"its:textAnalysisRule"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>
+      <strong class="hl-tag" style="color: #000096">&lt;assert</strong> <span class="hl-attribute" style="color: #F5844C">test</span>=<span class="hl-value" style="color: #993300">"@taClassRefPointer | @taSourcePointer | @taIdentPointer | @taIdentRefPointer"</span><strong class="hl-tag" style="color: #000096">&gt;</strong>Text analysis rule must specify at least target type class or target identity.<strong class="hl-tag" style="color: #000096">&lt;/assert&gt;</strong>
     <strong class="hl-tag" style="color: #000096">&lt;/rule&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;/pattern&gt;</strong>
   <strong class="hl-tag" style="color: #000096">&lt;pattern&gt;</strong>
@@ -5745,7 +5763,7 @@
             algorithm</a>.</p><p>As a natural language processing (NLP) tool, we choose <a href="https://github.com/dbpedia-spotlight/dbpedia-spotlight#readme" shape="rect">DBpedia
             Spotlight</a>. For this example let's assume DBpedia Spotlight linked "Ireland" to
           DBpedia:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">&lt;http://example.com/exampledoc.html#char=21,28&gt; 
-    rdf:type                 str:String ;
+    rdf:type                 nif:RFC5147String;
     itsrdf:taIdentRef  &lt;http://dbpedia.org/resource/Ireland&gt; .
 &lt;http://dbpedia.org/resource/Ireland&gt; 
     rdf:type                 &lt;http:/nerd.eurecom.fr/ontology#Country&gt; .
--- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd	2013/04/09 20:57:09	1.378
+++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd	2013/04/10 15:13:58	1.379
@@ -96,11 +96,11 @@
           href="http://www.w3.org/2004/02/Process-20040205/tr.html#maturity-levels">W3C document
           maturity levels</loc>).</p>
 
-      <p xml:id="open-issues">All <loc href="http://tinyurl.com/its20-comments-handling">last call issues</loc> in the normative sections (from <specref ref="notation-terminology"/> to
+      <p xml:id="open-issues">All <loc href="http://services.w3.org/xslt?xslfile=http%3A%2F%2Fwww.w3.org%2FInternational%2Fmultilingualweb%2Flt%2Fdrafts%2Fits20%2Ftools%2Fxslt%2Fdisco-for-its20.xsl&amp;xmlfile=https%3A%2F%2Fwww.w3.org%2FInternational%2Fmultilingualweb%2Flt%2Ftrack%2Fapi%2Fdump%3Ffull&amp;content-type=&amp;submit=transform">last call issues</loc> in the normative sections (from <specref ref="notation-terminology"/> to
           <specref ref="datacategory-description"/> and <specref ref="normative-references"/> to
-          <specref ref="its-schemas"/>) have been resolved, with 4 exceptions: @@@@. The other, non-normative sections contain only
+          <specref ref="its-schemas"/>) have been resolved, with <loc href="#unresolved-issues">four exceptions</loc>. The other, non-normative sections contain only
         explanatory material and will be updated in a later working draft. The Working Group
-         encourages feedback until end of April 2013 on the @@@ open issues: @@@@ sections.</p>
+         encourages feedback until end of April 2013 on the <loc href="#unresolved-issues">unresolved issues</loc>.</p>
 
       <p>To give feedback send your comments to <loc
           href="mailto:public-multilingualweb-lt-comments@w3.org"
@@ -1612,17 +1612,16 @@
             representations) that contain ITS metadata to the RDF-based format based on <ptr
               target="#NIF" type="bibref"/>. The conversion results in RDF triples.</p>
           <note><p>The algorithm is intended to extract the text from the XML/HTML/DOM for an NLP
-              tool and can produce a lot of <quote>phantom</quote> predicates from excessive
+              tool. It can produce a lot of <quote>phantom</quote> predicates from excessive
               whitespace, which 1) increases the size of the intermediate mapping and 2) extracts
-              this whitespace as text. This might decrease NLP performance. It is recommended to
+              this whitespace as text, and therefore might decrease NLP performance. It is strongly recommended to
               normalize whitespace in the input XML/HTML/DOM in order to minimize such phantom
-              predicates. A normalized example is given below. The whitespace normalization
-              algorithm itself is format dependent, e.g. it differs for HTML compared to general
-              XML. Hence no normative algorithm for whitespace normalization is given as part of
+              predicates. A normalized example is given below. Since the whitespace normalization
+              algorithm itself is format dependent, for example, it differs for HTML compared to general
+              XML, no normative algorithm for whitespace normalization is given as part of
               this specification.</p></note>
           <exemplum xml:id="EX-HTML-whitespace-normalization">
-            <head>Example of an HTML document with whitespace normalized as preparation for
-              conversion to NIF</head>
+            <head>Example of an HTML document with whitespace character normalization as preparation for the conversion to NIF</head>
             <eg><![CDATA[<html><body><h2 translate="yes">Welcome to <span 
    its-ta-ident-ref="http://dbpedia.org/resource/Dublin" its-within-text="yes"
    translate="no">Dublin</span> in <b translate="no" its-within-text="yes">Ireland</b>!</h2></body></html>]]></eg>
@@ -1632,8 +1631,7 @@
           <list type="unordered">
             <item><p xml:id="its2nif-algorithm-step1">STEP 1: Get an ordered list of all text nodes
                 of the document.</p></item>
-            <item><p xml:id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each
-                non-empty text node of all leaf elements and remember them.</p></item>
+            <item><p xml:id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each non-empty text node of all leaf elements and memorize them.</p></item>
             <item><p xml:id="its2nif-algorithm-step3">STEP 3: Get the text for each node and make a
                 tuple with the XPath expressions (X,T). Since the text nodes have a certain order we
                 now have a list of ordered tuples ((x0,t0), (x1,t1), ..., (xn,tn)).</p></item>
@@ -1695,41 +1693,35 @@
           <list type="unordered">
             <item><p xml:id="its2nif-algorithm-step5">STEP 5: Create a context URI and attach the
                 whole concatenated text of the document as reference.</p></item>
-            <item><p xml:id="its2nif-algorithm-step6">STEP 6: Now attach any ITS metadata items from
-                the XML/HTML/DOM input to respective NIF URIs.</p></item>
-            <item><p xml:id="its2nif-algorithm-step7">STEP 7: Omit all irrelevant URIs (those that
-                do not carry annotations, they will just bloat the data).</p></item>
+            <item><p xml:id="its2nif-algorithm-step6">STEP 6: Attach any ITS metadata annotations from the XML/HTML/DOM input to the respective NIF URIs.</p></item>
+            <item><p xml:id="its2nif-algorithm-step7">STEP 7: Omit all URIs that do not carry annotations (they will just bloat the data).</p></item>
           </list>
           <eg rend="text"><![CDATA[@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> .
+@prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#>
 <http://example.com/exampledoc.html#char=0,29>
-    rdf:type             str:Context ;
-    rdf:type             str:OffsetBasedString ;
+    rdf:type             nif:Context ;
+    rdf:type             nif:RFC5147String ;
 # concatenate the whole text
-    str:isString         "$(t0+t1+t2+...+tn)" ; 
+    nif:isString         "$(t0+t1+t2+...+tn)" ; 
     itsrdf:translate     "yes";
-    str:occursIn      <http://example.com/exampledoc.html> .
+    nif:occursIn      <http://example.com/exampledoc.html> .
 <http://example.com/exampledoc.html#char=11,17> 
-    rdf:type              str:String ;
-    rdf:type              str:OffsetBasedString ;
+    rdf:type              nif:RFC5147String;
+    rdf:type              nif:RFC5147String ;
     itsrdf:translate     "no";
     itsrdf:taIdentRef  <http://dbpedia.org/resource/Dublin> ;
-    str:referenceContext <http://example.com/exampledoc.html#char=0,29> .
+    nif:referenceContext <http://example.com/exampledoc.html#char=0,29> .
 <http://example.com/exampledoc.html#char=21,28> 
-    rdf:type              str:String ;
-    rdf:type              str:OffsetBasedString ;
+    rdf:type              nif:RFC5147String;
+    rdf:type              nif:RFC5147String ;
     itsrdf:translate     "no";
-    str:referenceContext <http://example.com/exampledoc.html#char=0,29> .
+    nif:referenceContext <http://example.com/exampledoc.html#char=0,29> .
 ]]></eg>
           <p>A complete sample output in RDF/XML format after step 7, given the input document <ptr
               target="#EX-HTML-whitespace-normalization" type="exref"/>, is available at <ref
               target="examples/nif/EX-nif-conversion-output.xml"
               >examples/nif/EX-nif-conversion-output.xml</ref>.</p>
-          <note type="ed">above example needs to be updated with regards to "text analysis" output.</note>
-          <note><p>The conversion to NIF is the basis for natural language processing (NLP)
-              applications, creating for example named entity annotations. A non-normative algorithm
-              to integrate these annotations into the original input document is given in <ptr
-                target="#nif-backconversion" type="specref"/>. The algorithm in that appendix is
-              non-normative since many choices depend on the actual NLP application.</p></note>
+          <note><p>The conversion to NIF is a possible basis for a natural language processing (NLP) application that creates, for example, named entity annotations. A non-normative algorithm to integrate these annotations into the original input document is given in <ptr target="#nif-backconversion" type="specref"/>. This algorithm is non-normative because many decisions depend on the actually employed NLP application.</p></note>
         </div>
         <div xml:id="its-tool-annotation">
           <head>ITS Tools Annotation</head>
@@ -4877,8 +4869,7 @@
               <ref target="http://www.iana.org/assignments/character-sets">Character Sets</ref>
             </title> Available at <ref target="http://www.iana.org/assignments/character-sets"
               >http://www.iana.org/assignments/character-sets</ref>.</bibl>
-          <bibl xml:id="NIF" n="NIF">NIF Placeholder at
-            http://persistence.uni-leipzig.org/nlp2rdf/</bibl>
+          <bibl xml:id="NIF" n="NIF">Hellmann, S.  (ed.). <ref target="http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core">NIF 2.0 Core Ontology Version 1.0</ref>, version April 2013. Available at http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core under CC-BY 2.0  license. </bibl>
           <bibl xml:id="qa-framework" n="QAFRAMEWORK">Karl Dubost, Lynne Rosental, Dominique
             Hazaël-Massieux, Lofton Henderson. <title>
               <ref target="http://www.w3.org/TR/2005/REC-qaframe-spec-20050817/">QA Framework:
@@ -5715,7 +5706,7 @@
             Spotlight</ref>. For this example let's assume DBpedia Spotlight linked "Ireland" to
           DBpedia:</p>
         <eg rend="text"><![CDATA[<http://example.com/exampledoc.html#char=21,28> 
-    rdf:type                 str:String ;
+    rdf:type                 nif:RFC5147String;
     itsrdf:taIdentRef  <http://dbpedia.org/resource/Ireland> .
 <http://dbpedia.org/resource/Ireland> 
     rdf:type                 <http:/nerd.eurecom.fr/ontology#Country> .
@@ -5959,6 +5950,15 @@
       </div>
       <div xml:id="revisionlog" type="inform">
         <head>Revision Log</head>
+        <p xml:id="unresolved-issues">The following four issues have not been resolved since the publication of the last call <ref target="http://www.w3.org/TR/2012/WD-its20-20121206/"
+          >ITS 2.0 Working Draft 6 December 2012</ref>:</p>
+        <list type="unordered">
+          <item><ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/67">issue-67: Change definition of regular expression for allowed characters</ref></item>
+          <item><ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/91">issue-91: Parts of Ruby section should be removed</ref>. The Working Group decided to remove the Ruby section but seeks feedback before finalizing this decision.</item>
+          <item><ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/97">issue-97: HTML translate</ref></item>
+          <item><ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/118">issue-118: HTML ITS default behaviour</ref></item>
+        </list>
+        <p>The Working Group encourages feedback on these issues by end of April 2013.</p>
         <p xml:id="changelog-since-20121206">The following log records major changes that have been
           made to this document since the <ref target="http://www.w3.org/TR/2012/WD-its20-20121206/"
             >ITS 2.0 Working Draft 6 December 2012</ref>.</p>
Received on Wednesday, 10 April 2013 15:14:01 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:32:41 UTC