- From: CVS User fsasaki <cvsmail@w3.org>
- Date: Sat, 06 Apr 2013 10:55:38 +0000
- To: public-multilingualweb-lt-commits@w3.org
Update of /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20 In directory gil:/tmp/cvs-serv11940 Modified Files: its20.html its20.odd Log Message: changes related to NIF - action-458 and action-480 --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html 2013/04/06 09:13:54 1.378 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html 2013/04/06 10:55:37 1.379 @@ -54,9 +54,9 @@ <div class="toc3">5.2.2 <a href="#selection-local" shape="rect">Local Selection in an XML Document</a></div> </div> <div class="toc2">5.3 <a href="#selectors" shape="rect">Query Language of Selectors</a><div class="toc3">5.3.1 <a href="#queryLanguage" shape="rect">Choosing Query Language</a></div> -<div class="toc3">5.3.2 <a href="#d0e2054" shape="rect">XPath 1.0</a></div> +<div class="toc3">5.3.2 <a href="#d0e2059" shape="rect">XPath 1.0</a></div> <div class="toc3">5.3.3 <a href="#css-selectors" shape="rect">CSS Selectors</a></div> -<div class="toc3">5.3.4 <a href="#d0e2299" shape="rect">Additional query languages</a></div> +<div class="toc3">5.3.4 <a href="#d0e2304" shape="rect">Additional query languages</a></div> <div class="toc3">5.3.5 <a href="#its-param" shape="rect">Variables in selectors</a></div> </div> <div class="toc2">5.4 <a href="#link-external-rules" shape="rect">Link to External Rules</a></div> @@ -643,7 +643,7 @@ and localization of XML schemas and documents.] The concept of a data category is independent of its implementation in an XML and HTML environment (e.g. using an element or attribute).</p><p>For each data category, ITS distinguishes between the following:</p><ul><li><p>the prose description, see <a class="section-ref" href="#datacategory-description" shape="rect">Section 8: Description of Data Categories</a></p></li><li><p>schema language independent formalization, see the "implementation" subsections in - <a class="section-ref" href="#datacategory-description" shape="rect">Section 8: Description of Data Categories</a></p></li><li><p>schema language specific implementations, see <a class="section-ref" href="#its-schemas" shape="rect">Appendix D: Schemas for ITS</a></p></li></ul><div class="exampleOuter"><div class="exampleHeader"><a name="d0e1132" id="d0e1132" shape="rect"/>Example 13: A data category and its implementation</div><p>The <a href="#trans-datacat" shape="rect">Translate</a> data category conveys information as + <a class="section-ref" href="#datacategory-description" shape="rect">Section 8: Description of Data Categories</a></p></li><li><p>schema language specific implementations, see <a class="section-ref" href="#its-schemas" shape="rect">Appendix D: Schemas for ITS</a></p></li></ul><div class="exampleOuter"><div class="exampleHeader"><a name="d0e1137" id="d0e1137" shape="rect"/>Example 13: A data category and its implementation</div><p>The <a href="#trans-datacat" shape="rect">Translate</a> data category conveys information as to whether a piece of content should be translated or not.</p><p>The simplest formalization of this prose description on a schema language independent level is a <code class="its-attr-markup">translate</code> attribute with two possible values: "yes" and "no". An implementation on a schema language specific @@ -910,9 +910,9 @@ actual query language. The query language is set by <code class="its-attr-markup">queryLanguage</code> attribute on <code class="its-elem-markup">rules</code> element. If <code class="its-attr-markup">queryLanguge</code> is not specified XPath 1.0 is used as a default query language.</p></div><div class="div3"> -<h4><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="d0e2054" id="d0e2054" shape="rect"/>5.3.2 XPath 1.0</h4><p>XPath 1.0 is identified by <code>xpath</code> value in <code class="its-attr-markup">queryLanguage</code> +<h4><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="d0e2059" id="d0e2059" shape="rect"/>5.3.2 XPath 1.0</h4><p>XPath 1.0 is identified by <code>xpath</code> value in <code class="its-attr-markup">queryLanguage</code> attribute.</p><div class="div4"> -<h5><a name="d0e2065" id="d0e2065" shape="rect"/>5.3.2.1 Absolute selector</h5><p>The absolute selector <a href="#rfc-keywords" shape="rect">MUST</a> be an XPath expression +<h5><a name="d0e2070" id="d0e2070" shape="rect"/>5.3.2.1 Absolute selector</h5><p>The absolute selector <a href="#rfc-keywords" shape="rect">MUST</a> be an XPath expression which starts with "<code>/</code>". That is, it must be an <a href="http://www.w3.org/TR/xpath/#NT-AbsoluteLocationPath" shape="rect"> AbsoluteLocationPath</a> or union of <a href="http://www.w3.org/TR/xpath/#NT-AbsoluteLocationPath" shape="rect"> AbsoluteLocationPath</a>s as described in <a href="#xpath" shape="rect">XPath 1.0</a>. @@ -957,14 +957,14 @@ implementations can be used.</p></div><div class="note"><p class="prefix"><b>Note:</b></p><p id="css-selectors-and-attributes">CSS selectors have no ability to point to attributes.</p></div><p>CSS Selectors are identified by <code>css</code> value in <code class="its-attr-markup">queryLanguage</code> attribute.</p><div class="div4"> -<h5><a name="d0e2276" id="d0e2276" shape="rect"/>5.3.3.1 Absolute selector</h5><p>Absolute selector <a href="#rfc-keywords" shape="rect">MUST</a> be interpreted as selector +<h5><a name="d0e2281" id="d0e2281" shape="rect"/>5.3.3.1 Absolute selector</h5><p>Absolute selector <a href="#rfc-keywords" shape="rect">MUST</a> be interpreted as selector as defined in <a title="Selectors Level
 3" href="#css3-selectors" shape="rect">[Selectors Level 3]</a>. Both simple selectors and groups of selectors can be used.</p></div><div class="div4"> -<h5><a name="d0e2286" id="d0e2286" shape="rect"/>5.3.3.2 Relative selector</h5><p>Relative selector <a href="#rfc-keywords" shape="rect">MUST</a> be interpreted as selector +<h5><a name="d0e2291" id="d0e2291" shape="rect"/>5.3.3.2 Relative selector</h5><p>Relative selector <a href="#rfc-keywords" shape="rect">MUST</a> be interpreted as selector as defined in <a title="Selectors Level
 3" href="#css3-selectors" shape="rect">[Selectors Level 3]</a>. Selector is not evaluated against the complete document tree but only against subtrees rooted at nodes selected by selector in the <code class="its-attr-markup">selector</code> attribute.</p></div></div><div class="div3"> -<h4><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="d0e2299" id="d0e2299" shape="rect"/>5.3.4 Additional query languages</h4><p>ITS processors <a href="#rfc-keywords" shape="rect">MAY</a> support additional query +<h4><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="d0e2304" id="d0e2304" shape="rect"/>5.3.4 Additional query languages</h4><p>ITS processors <a href="#rfc-keywords" shape="rect">MAY</a> support additional query languages. For each additional query language the processor <a href="#rfc-keywords" shape="rect">MUST</a> define:</p><ul><li><p>identifier of query language used in <code class="its-attr-markup">queryLanguage</code>;</p></li><li><p>rules for evaluating absolute selector to collection of nodes;</p></li><li><p>rules for evaluating relative selector to collection of nodes.</p></li></ul><p>Because future versions of this specification are likely to define additional query languages, the following query language identifiers are reserved: <code>xpath</code>, <code>css</code>, <code>xpath2</code>, <code>xpath3</code>, <code>xquery</code>, @@ -1149,8 +1149,8 @@ XML. Hence no normative algorithm for whitespace normalization is given as part of this specification.</p></div><div class="exampleOuter"><div class="exampleHeader"><a name="EX-HTML-whitespace-normalization" id="EX-HTML-whitespace-normalization" shape="rect"/>Example 25: Example of an HTML document with whitespace normalized as preparation for conversion to NIF</div><div class="exampleInner"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096"><html></strong><strong class="hl-tag" style="color: #000096"><body></strong><strong class="hl-tag" style="color: #000096"><h2</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Welcome to <strong class="hl-tag" style="color: #000096"><span</strong> - <span class="hl-attribute" style="color: #F5844C">its-ta-ident-ref</span>=<span class="hl-value" style="color: #993300">"http://dbpedia.org/resource/Dublin"</span> - <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">></strong>Dublin<strong class="hl-tag" style="color: #000096"></span></strong> in <strong class="hl-tag" style="color: #000096"><b</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">></strong>Ireland<strong class="hl-tag" style="color: #000096"></b></strong>!<strong class="hl-tag" style="color: #000096"></h2></strong><strong class="hl-tag" style="color: #000096"></body></strong><strong class="hl-tag" style="color: #000096"></html></strong></pre></div></div><p id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven + <span class="hl-attribute" style="color: #F5844C">its-ta-ident-ref</span>=<span class="hl-value" style="color: #993300">"http://dbpedia.org/resource/Dublin"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span> + <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span><strong class="hl-tag" style="color: #000096">></strong>Dublin<strong class="hl-tag" style="color: #000096"></span></strong> in <strong class="hl-tag" style="color: #000096"><b</strong> <span class="hl-attribute" style="color: #F5844C">translate</span>=<span class="hl-value" style="color: #993300">"no"</span> <span class="hl-attribute" style="color: #F5844C">its-within-text</span>=<span class="hl-value" style="color: #993300">"yes"</span><strong class="hl-tag" style="color: #000096">></strong>Ireland<strong class="hl-tag" style="color: #000096"></b></strong>!<strong class="hl-tag" style="color: #000096"></h2></strong><strong class="hl-tag" style="color: #000096"></body></strong><strong class="hl-tag" style="color: #000096"></html></strong></pre></div></div><p id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven steps.</p><ul><li><p id="its2nif-algorithm-step1">STEP 1: Get an ordered list of all text nodes of the document.</p></li><li><p id="its2nif-algorithm-step2">STEP 2: Generate an XPath expression for each non-empty text node of all leaf elements and remember them.</p></li><li><p id="its2nif-algorithm-step3">STEP 3: Get the text for each node and make a @@ -1159,12 +1159,12 @@ The list with the XPath-to-text mapping can also be kept in memory. Part of a serialization example is given below.</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . <http://example.com/exampledoc.html#xpath(x0)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_b0_e0> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=b0,e0> <http://example.com/exampledoc.html#xpath(x1)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_b1_e1> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=b1,e1> # ... <http://example.com/exampledoc.html#xpath(xn)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_bn_en> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=bn,en> <mappings> <mapping x="xpath(x0)" b="b0" e="e0" /> <mapping x="xpath(x1)" b="b1" e="e1" /> @@ -1180,22 +1180,22 @@ </pre></div></div><p>Example (continued)</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . # "Welcome to " <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_11> . + itsrdf:nif <http://example.com/exampledoc.html#char=0,11> . # "Dublin" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/span[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_11_17> . + itsrdf:nif <http://example.com/exampledoc.html#char=11,17> . # " in " <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[2])> - itsrdf:nif <http://example.com/exampledoc.html#offset_17_21> . + itsrdf:nif <http://example.com/exampledoc.html#char=17,21> . # "Ireland" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_21_28> . + itsrdf:nif <http://example.com/exampledoc.html#char=21,28> . # "!" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[3])> - itsrdf:nif <http://example.com/exampledoc.html#offset_28_29> . + itsrdf:nif <http://example.com/exampledoc.html#char=28,29> . # "Welcome to Dublin Ireland!" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text())> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_29> . + itsrdf:nif <http://example.com/exampledoc.html#char=0,29> . <mappings> <mapping x="xpath(/html/body[1]/h2[1]/text()[1])" b="0" e="11" /> <mapping x="xpath(/html/body[1]/h2[1]/span[1]/text()[1])" b="11" e="17" /> @@ -1207,24 +1207,24 @@ whole concatenated text of the document as reference.</p></li><li><p id="its2nif-algorithm-step6">STEP 6: Now attach any ITS metadata items from the XML/HTML/DOM input to respective NIF URIs.</p></li><li><p id="its2nif-algorithm-step7">STEP 7: Omit all irrelevant URIs (those that do not carry annotations, they will just bloat the data).</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . -<http://example.com/exampledoc.html#offset_0_29> +<http://example.com/exampledoc.html#char=0,29> rdf:type str:Context ; rdf:type str:OffsetBasedString ; # concatenate the whole text str:isString "$(t0+t1+t2+...+tn)" ; - itsrdf:translate "yes"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; + itsrdf:translate "yes"; str:occursIn <http://example.com/exampledoc.html> . -<http://example.com/exampledoc.html#offset_11_17> +<http://example.com/exampledoc.html#char=11,17> rdf:type str:String ; rdf:type str:OffsetBasedString ; - itsrdf:translate "no"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; + itsrdf:translate "no"; itsrdf:taIdentRef <http://dbpedia.org/resource/Dublin> ; - str:referenceContext <http://example.com/exampledoc.html#offset_0_29> . -<http://example.com/exampledoc.html#offset_21_28> + str:referenceContext <http://example.com/exampledoc.html#char=0,29> . +<http://example.com/exampledoc.html#char=21,28> rdf:type str:String ; rdf:type str:OffsetBasedString ; - itsrdf:translate "no"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; - str:referenceContext <http://example.com/exampledoc.html#offset_0_29> . + itsrdf:translate "no"; + str:referenceContext <http://example.com/exampledoc.html#char=0,29> . </pre></div></div><p>A complete sample output in RDF/XML format after step 7, given the input document <a href="#EX-HTML-whitespace-normalization" shape="rect">Example 25</a>, is available at <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">examples/nif/EX-nif-conversion-output.xml</a>.</p><span class="editor-note">[Ed. note: above example needs to be updated with regards to "text analysis" output.]</span><div class="note"><p class="prefix"><b>Note:</b></p><p>The conversion to NIF is the basis for natural language processing (NLP) applications, creating for example named entity annotations. A non-normative algorithm to integrate these annotations into the original input document is given in <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix F: Conversion NIF2ITS</a>. The algorithm in that appendix is @@ -2176,7 +2176,7 @@ data category can be expressed with global rules, or locally on an individual element. There is no inheritance.</p><p id="text-analysis-global"> GLOBAL: The - <code class="its-elem-markup">TextAnalysisRule</code> + <code class="its-elem-markup">textAnalysisRule</code> element contains the following: </p><ul><li><p> A required @@ -5742,7 +5742,7 @@ converted to NIF, leading to the <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">output</a> exemplified for the <a href="#its2nif-algorithm" shape="rect">ITS2NIF conversion algorithm</a>.</p><p>As a natural language processing (NLP) tool, we choose <a href="https://github.com/dbpedia-spotlight/dbpedia-spotlight#readme" shape="rect">DBpedia Spotlight</a>. For this example let's assume DBpedia Spotlight linked "Ireland" to - DBpedia:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><http://example.com/exampledoc.html#offset_21_28> + DBpedia:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><http://example.com/exampledoc.html#char=21,28> rdf:type str:String ; itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . <http://dbpedia.org/resource/Ireland> @@ -5756,9 +5756,9 @@ to the latest definition of "textanalysis".]</span><p>CASE 1: The NLP annotation created in NIF matches the text node. Solution: Attach the annotation to the parent element of the text node.</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"># Based on: <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_21_28> . + itsrdf:nif <http://example.com/exampledoc.html#char=21,28> . # and: -<http://example.com/exampledoc.html#offset_21_28> +<http://example.com/exampledoc.html#char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # we can attach the metadata to the parent node: <b its-ta-ident-ref="http://dbpedia.org/resource/Dublin” @@ -5776,11 +5776,11 @@ # ITS2NIF <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_29> + itsrdf:nif <http://example.com/exampledoc.html#char=0,29> # DBpedia Spotlight returns: -<http://example.com/exampledoc.html#offset_21_28> +<http://example.com/exampledoc.html#char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # NIF2ITS @@ -5973,7 +5973,7 @@ HTML, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/123" shape="rect">issue-123</a>.</p></li><li><p>Put <a href="#css-selectors-and-attributes" shape="rect">a note</a> in <a class="section-ref" href="#css-selectors" shape="rect">Section 5.3.3: CSS Selectors</a> about CSS selectors and attributes, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/99" shape="rect">issue-99</a>.</p></li><li><p>Updated <a href="#lqissue-typevalues" shape="rect">the table of Localization Quality Issue Type values</a> to clarify the value of “inconsistency,” see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/76" shape="rect">issue-76</a>.</p></li><li><p>Updated <a href="#lqissue-typevalues" shape="rect">the table of Localization Quality Issue Type values</a> to properly use RFC2119 values. See <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/112" shape="rect">issue-112</a> and <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/124" shape="rect">issue-124</a>.</p></li><li><p>Updated <a href="#lqissue" shape="rect">Localization Quality Issue</a> to reference ISO/TS - 11669 and Structure specifications. See <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/83" shape="rect">issue-83</a>.</p></li><li><p>Renamed Disambiguation data category to Text Analysis. Removed <code>disambigGranularity</code> attribute. All other attributes of Disambiguation were renamed to have the prefix "ta". Rewrote <a href="#textanalysis" shape="rect">defining section</a>. See for all changes <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/68" shape="rect">issue-68</a>.</p></li><li><p>Added non-normative reference to ITS 1.0 <a title="
Internationalization Tag Set (ITS) Version 1.0
" href="#its10" shape="rect">[ITS 1.0]</a>.</p></li></ol><p id="changelog-since-20121023">The following log records major changes that have been + 11669 and Structure specifications. See <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/83" shape="rect">issue-83</a>.</p></li><li><p>Renamed Disambiguation data category to Text Analysis. Removed <code>disambigGranularity</code> attribute. All other attributes of Disambiguation were renamed to have the prefix "ta". Rewrote <a href="#textanalysis" shape="rect">defining section</a>. See for all changes <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/68" shape="rect">issue-68</a>.</p></li><li><p>Added non-normative reference to ITS 1.0 <a title="
Internationalization Tag Set (ITS) Version 1.0
" href="#its10" shape="rect">[ITS 1.0]</a>.</p></li><li><p>Updated <a class="section-ref" href="#conversion-to-nif" shape="rect">Section 5.7: Conversion to NIF</a> and <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix F: Conversion NIF2ITS</a> with new NIF URI (see <a href="https://www.w3.org/International/multilingualwb/lt/track/actions/460" shape="rect">action-460</a>), fragment identifiers (see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/458" shape="rect">action-458</a>) and ITS ontology predicates. Changes to be confirmed, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/481" shape="rect">action-481</a>.</p></li></ol><p id="changelog-since-20121023">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2012/WD-its20-20121023/" shape="rect">ITS 2.0 Working Draft 23 October 2012</a>.</p><ol class="depth1"><li><p>Clarified usage of <a href="#domain" shape="rect">Domain</a> data category in HTML in response to <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/56" shape="rect">issue-56</a>.</p></li><li><p>Added the <a href="#lqissueDefs" shape="rect">enabled information</a> in <a class="section-ref" href="#lqissue" shape="rect">Section 8.17: Localization Quality Issue</a>.</p></li><li><p>Updated the Disambiguation data category.</p></li><li><p>Fine tuned the algorithm to compute the result values of the <a href="#domain" shape="rect">Domain</a> data category.</p></li><li><p>Fix on <a href="#EX-locQualityIssue-html5-local-2" shape="rect">Example 81</a>: <code>id</code> attribute of <code>script</code> element now the same as of containing --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd 2013/04/06 09:32:35 1.374 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd 2013/04/06 10:55:37 1.375 @@ -1623,8 +1623,8 @@ <head>Example of an HTML document with whitespace normalized as preparation for conversion to NIF</head> <eg><![CDATA[<html><body><h2 translate="yes">Welcome to <span - its-ta-ident-ref="http://dbpedia.org/resource/Dublin" - translate="no">Dublin</span> in <b translate="no">Ireland</b>!</h2></body></html>]]></eg> + its-ta-ident-ref="http://dbpedia.org/resource/Dublin" its-within-text="yes" + translate="no">Dublin</span> in <b translate="no" its-within-text="yes">Ireland</b>!</h2></body></html>]]></eg> </exemplum> <p xml:id="its2nif-algorithm">The conversion algorithm to generate NIF consists of seven steps.</p> @@ -1642,12 +1642,12 @@ </list> <eg rend="text"><![CDATA[@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . <http://example.com/exampledoc.html#xpath(x0)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_b0_e0> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=b0,e0> <http://example.com/exampledoc.html#xpath(x1)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_b1_e1> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=b1,e1> # ... <http://example.com/exampledoc.html#xpath(xn)> - itsrdf:xpath2nif <http://example.com/exampledoc.html#offset_bn_en> + itsrdf:xpath2nif <http://example.com/exampledoc.html#char=bn,en> <mappings> <mapping x="xpath(x0)" b="b0" e="e0" /> <mapping x="xpath(x1)" b="b1" e="e1" /> @@ -1667,22 +1667,22 @@ <eg rend="text"><![CDATA[@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . # "Welcome to " <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_11> . + itsrdf:nif <http://example.com/exampledoc.html#char=0,11> . # "Dublin" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/span[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_11_17> . + itsrdf:nif <http://example.com/exampledoc.html#char=11,17> . # " in " <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[2])> - itsrdf:nif <http://example.com/exampledoc.html#offset_17_21> . + itsrdf:nif <http://example.com/exampledoc.html#char=17,21> . # "Ireland" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_21_28> . + itsrdf:nif <http://example.com/exampledoc.html#char=21,28> . # "!" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[3])> - itsrdf:nif <http://example.com/exampledoc.html#offset_28_29> . + itsrdf:nif <http://example.com/exampledoc.html#char=28,29> . # "Welcome to Dublin Ireland!" <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text())> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_29> . + itsrdf:nif <http://example.com/exampledoc.html#char=0,29> . <mappings> <mapping x="xpath(/html/body[1]/h2[1]/text()[1])" b="0" e="11" /> <mapping x="xpath(/html/body[1]/h2[1]/span[1]/text()[1])" b="11" e="17" /> @@ -1700,24 +1700,24 @@ do not carry annotations, they will just bloat the data).</p></item> </list> <eg rend="text"><![CDATA[@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . -<http://example.com/exampledoc.html#offset_0_29> +<http://example.com/exampledoc.html#char=0,29> rdf:type str:Context ; rdf:type str:OffsetBasedString ; # concatenate the whole text str:isString "$(t0+t1+t2+...+tn)" ; - itsrdf:translate "yes"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; + itsrdf:translate "yes"; str:occursIn <http://example.com/exampledoc.html> . -<http://example.com/exampledoc.html#offset_11_17> +<http://example.com/exampledoc.html#char=11,17> rdf:type str:String ; rdf:type str:OffsetBasedString ; - itsrdf:translate "no"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; + itsrdf:translate "no"; itsrdf:taIdentRef <http://dbpedia.org/resource/Dublin> ; - str:referenceContext <http://example.com/exampledoc.html#offset_0_29> . -<http://example.com/exampledoc.html#offset_21_28> + str:referenceContext <http://example.com/exampledoc.html#char=0,29> . +<http://example.com/exampledoc.html#char=21,28> rdf:type str:String ; rdf:type str:OffsetBasedString ; - itsrdf:translate "no"^^<http://www.w3.org/TR/its-2.0/its.xsd#yesOrNo> ; - str:referenceContext <http://example.com/exampledoc.html#offset_0_29> . + itsrdf:translate "no"; + str:referenceContext <http://example.com/exampledoc.html#char=0,29> . ]]></eg> <p>A complete sample output in RDF/XML format after step 7, given the input document <ptr target="#EX-HTML-whitespace-normalization" type="exref"/>, is available at <ref @@ -5700,7 +5700,7 @@ target="https://github.com/dbpedia-spotlight/dbpedia-spotlight#readme">DBpedia Spotlight</ref>. For this example let's assume DBpedia Spotlight linked "Ireland" to DBpedia:</p> - <eg rend="text"><![CDATA[<http://example.com/exampledoc.html#offset_21_28> + <eg rend="text"><![CDATA[<http://example.com/exampledoc.html#char=21,28> rdf:type str:String ; itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . <http://dbpedia.org/resource/Ireland> @@ -5727,9 +5727,9 @@ annotation to the parent element of the text node.</p> <eg rend="text"><![CDATA[# Based on: <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_21_28> . + itsrdf:nif <http://example.com/exampledoc.html#char=21,28> . # and: -<http://example.com/exampledoc.html#offset_21_28> +<http://example.com/exampledoc.html#char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # we can attach the metadata to the parent node: <b its-ta-ident-ref="http://dbpedia.org/resource/Dublin” @@ -5749,11 +5749,11 @@ # ITS2NIF <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> - itsrdf:nif <http://example.com/exampledoc.html#offset_0_29> + itsrdf:nif <http://example.com/exampledoc.html#char=0,29> # DBpedia Spotlight returns: -<http://example.com/exampledoc.html#offset_21_28> +<http://example.com/exampledoc.html#char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # NIF2ITS @@ -6101,6 +6101,7 @@ >issue-83</ref>.</item> <item>Renamed Disambiguation data category to Text Analysis. Removed <code>disambigGranularity</code> attribute. All other attributes of Disambiguation were renamed to have the prefix <val>ta</val>. Rewrote <ref target="#textanalysis">defining section</ref>. See for all changes <ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/68">issue-68</ref>.</item> <item>Added non-normative reference to ITS 1.0 <ptr target="#its10" type="bibref"/>.</item> + <item>Updated <ptr target="#conversion-to-nif" type="specref"/> and <ptr target="#nif-backconversion" type="specref"/> with new NIF URI (see <ref target="https://www.w3.org/International/multilingualweb/lt/track/actions/460">action-460</ref>), fragment identifiers (see <ref target="https://www.w3.org/International/multilingualweb/lt/track/actions/458">action-458</ref>) and ITS ontology predicates. Changes to be confirmed, see <ref target="https://www.w3.org/International/multilingualweb/lt/track/actions/481">action-481</ref>.</item> </list> <p xml:id="changelog-since-20121023">The following log records major changes that have been made to this document since the <ref target="http://www.w3.org/TR/2012/WD-its20-20121023/"
Received on Saturday, 6 April 2013 10:55:41 UTC