- From: CVS User fsasaki <cvsmail@w3.org>
- Date: Fri, 06 Sep 2013 10:06:48 +0000
- To: public-multilingualweb-lt-commits@w3.org
Update of /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20 In directory gil:/tmp/cvs-serv754 Modified Files: its20-implementation-report.html its20.html its20.odd Log Message: edits for issue-131 (tbc) --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20-implementation-report.html 2013/09/06 09:31:19 1.16 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20-implementation-report.html 2013/09/06 10:06:47 1.17 @@ -179,9 +179,9 @@ </head> <body> <h1>ITS 2.0 Implementation Report</h1> - <p>Version generated: 2013-09-06T11:30:02.485+02:00</p> - <p>This document is the implementation report for the <a href="http://www.w3.org/International/multilingualweb/lt/">MultilingualWeb-LT</a> Working Group's <a href="http://www.w3.org/TR/2013/WD-its20-20130521/">Internationalization - Tag Set 2.0 21 May 2013 Last Call Working Draft</a>. The report contains + <p>Version generated: 2013-09-06T11:33:59.764+02:00</p> + <p>This document is the implementation report for the <a href="http://www.w3.org/International/multilingualweb/lt/">MultilingualWeb-LT</a> Working Group's <a href="http://www.w3.org/TR/2013/WD-its20-20130820/">Internationalization + Tag Set 2.0 (20 August 2013 Last Call Working Draft)</a>. The report contains the following sections:</p> <ul> <li class="toc"> --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html 2013/09/05 16:08:23 1.492 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.html 2013/09/06 10:06:47 1.493 @@ -5613,13 +5613,13 @@ is in XML (the <code>mappings</code> element).</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"># Turtle example: @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> . @prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . -<http://example.com/exampledoc.html#char=b0,e0> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(x0)> . -<http://example.com/exampledoc.html#char=b1,e1> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(x1)> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=b0,e0> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=x0> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=b1,e1> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=x1> . # ... -<http://example.com/exampledoc.html#char=bn,en> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(xn)> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=bn,en> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=xn> . <!-- XML Example --> <mappings> <mapping x="xpath(x0)" b="b0" e="e0" /> @@ -5638,29 +5638,29 @@ @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> . @prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . # "Welcome to " -<http://example.com/exampledoc.html#char=0,11> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,11> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])>. + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[1]>. # "Dublin" -<http://example.com/exampledoc.html#char=11,17> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=11,17> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/span[1]/text()[1])>. + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/span[1]/text()[1]>. # " in " -<http://example.com/exampledoc.html#char=17,21> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=17,21> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[2])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[2]> . # "Ireland" -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/b[1]/text()[1]> . # "!" -<http://example.com/exampledoc.html#char=28,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=28,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[3])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[3]> . # "Welcome to Dublin Ireland!" -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text())> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()> . <!-- XML Example --> <mappings> <mapping x="xpath(/html/body[1]/h2[1]/text()[1])" b="0" e="11" /> @@ -5673,7 +5673,7 @@ whole concatenated text <code>$(t0+t1+t2+...+tn)</code> of the document as reference.</p></li><li><p id="its2nif-algorithm-step6">STEP 6: Attach any ITS metadata annotations from the XML/HTML/DOM input to the respective NIF URIs.</p></li><li><p id="its2nif-algorithm-step7">STEP 7: Omit all URIs that do not carry annotations (to avoid bloating the data).</p></li></ul><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve">@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> rdf:type nif:Context ; rdf:type nif:RFC5147String ; # concatenate the whole text @@ -5682,34 +5682,52 @@ nif:endIndex "29" ; itsrdf:translate "yes"; nif:sourceUrl <http://example.com/exampledoc.html> . -<http://example.com/exampledoc.html#char=11,17> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=11,17> rdf:type nif:RFC5147String ; nif:beginIndex "11" ; nif:endIndex "17" ; itsrdf:translate "no"; itsrdf:taIdentRef <http://dbpedia.org/resource/Dublin> ; - nif:referenceContext <http://example.com/exampledoc.html#char=0,29> . -<http://example.com/exampledoc.html#char=21,28> + nif:referenceContext <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> rdf:type nif:RFC5147String ; nif:beginIndex "21" ; nif:endIndex "28" ; itsrdf:translate "no"; - nif:referenceContext <http://example.com/exampledoc.html#char=0,29> . + nif:referenceContext <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> . </pre></div></div><p>A complete sample output in RDF/XML format after step 7, given the input document <a href="#EX-HTML-whitespace-normalization" shape="rect">Example 97</a>, is available at <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">examples/nif/EX-nif-conversion-output.xml</a>.</p><div class="note"><p class="prefix"><b>Note:</b></p><p>The conversion to NIF is a possible basis for a natural language processing (NLP) application that creates, for example, named entity annotations. A non-normative algorithm to integrate these annotations into the original input document is given in <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix G: Conversion NIF2ITS</a>. Many decisions to be made in this algorithm - depend on the particular NLP application being used.</p></div></div><div class="div1"> + depend on the particular NLP application being used.</p></div><div class="note"><p class="prefix"><b>Note:</b></p><p>NIF allows URL for a String resource to be referenced as URIs + that are fragments of the original document in the form:<br clear="none"/><code>http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html#char=0,11</code> + <br clear="none"/>or + <br clear="none"/><code>http://example.com/doc.html#xpath(/html/body[1]/h2[1]/text()[1])</code> + <br clear="none"/> + + This offers a convenient mechanism for linking NIF resources in RDF back + to the original document. RDF treats URIs as opaque and does not impose + any semantic constraints on the used fragment identifiers, thus enabling + their usage in RDF in a consistent manner. However, fragment identifiers + get interpreted according to the retrieved mime type, if a retrieval + action occurs as is the case in Linked Data. The char fragment is + defined currently only for text/plain while the xpath fragment is not + defined for HTML. Therefore this URL recipe does fulfil the ITS + requirements to support both XML and HTML and the aim of this mapping to + produce resources adhering to the Linked Data principle of + dereferenceablility. The future definition and registration of these + fragment types, while a potentially attractive feature, is beyond the + scope of this specification.</p></div></div><div class="div1"> <h2><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="nif-backconversion" id="nif-backconversion" shape="rect"/>G Conversion NIF2ITS (Non-Normative)</h2><p>The following algorithm relies on <a href="#EX-HTML-whitespace-normalization" shape="rect">Example 97</a>. It is assumed that the example has been converted to NIF, leading to the <a href="examples/nif/EX-nif-conversion-output.xml" shape="rect">output</a> exemplified for the <a href="#its2nif-algorithm" shape="rect">ITS2NIF conversion algorithm</a>.</p><p>This example uses <a href="https://github.com/dbpedia-spotlight/dbpedia-spotlight" shape="rect">DBpedia Spotlight</a> as an example natural language processing (NLP) tool. In it, DBpedia - Spotlight linked "Ireland" to DBpedia:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><http://example.com/exampledoc.html#char=21,28> + Spotlight linked "Ireland" to DBpedia:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> rdf:type nif:RFC5147String; itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . <http://dbpedia.org/resource/Ireland> rdf:type <http:/nerd.eurecom.fr/ontology#Country> .</pre></div></div><p id="nif2its-algorithm">The conversion algorithm to generate ITS out of NIF consists of two steps:</p><ul><li><p id="nif2its-algorithm-step1">STEP 1: NIF Web services accept two different types of input. It is possible to either send the extracted text (the object of the <code>nif:isString</code> property) directly or NIF RDF to the NLP tool, i.e. the text is sent as a <code>nif:Context</code> node and included as <code>nif:isString</code>. Either way, the output of the Web service will be a NIF representation. - </p><p>Accepting text will be the minimal requirement of a NIF web service. Ideally, you would be able to send the <code>nif:Context</code> node with the isString as RDF directly, which has the advantage, that all other annotations can be used by the NLP tool:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096"><http:</strong>//example.com/exampledoc.html#char=0,29> + </p><p>Accepting text will be the minimal requirement of a NIF web service. Ideally, you would be able to send the <code>nif:Context</code> node with the isString as RDF directly, which has the advantage, that all other annotations can be used by the NLP tool:</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"><strong class="hl-tag" style="color: #000096"><http:</strong>//example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> rdf:type nif:RFC5147String ; rdf:type nif:Context ; nif:beginIndex "0" ; @@ -5718,11 +5736,11 @@ after <a href="#its2nif-algorithm-step7" shape="rect">step 7</a> of the ITS2NIF algorithm) to reintegrate annotations in the original ITS annotated document.</p></li></ul><p>For step 2, three cases can occur.</p><p>CASE 1: The NLP annotation created in NIF matches the text node. Solution: Attach the annotation to the parent element of the text node.</p><div class="exampleInner"><div class="exampleOuter"><pre xml:space="preserve"># Based on: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/b[1]/text()[1]> . # and: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # we can attach the metadata to the parent node: <b its-ta-ident-ref="http://dbpedia.org/resource/Dublin" @@ -5739,13 +5757,13 @@ # ITS2NIF -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[1]> . # DBpedia Spotlight returns: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # NIF2ITS @@ -5917,7 +5935,7 @@ <code class="its-attr-markup">lineBreakType</code>?</td><td rowspan="1" colspan="1"> <code class="its-attr-markup">its-storage-size</code>, <code class="its-attr-markup">its-storage-encoding</code>?, <code class="its-attr-markup">lits-line-break-type</code>?</td></tr></tbody></table></div><div class="div1"> -<h2><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="revisionlog" id="revisionlog" shape="rect"/>J Revision Log (Non-Normative)</h2><p id="changelog-since-20130820">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2013/WD-its20-20130820/" shape="rect">ITS 2.0 Working Draft 20 August 2013</a>:</p><ol class="depth1"><li><p>Updated reference to HTML5, see related <a href="http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2013Sep/0004.html" shape="rect">mail thread</a>.</p></li><li><p>Edits in <a class="section-ref" href="#lqissue" shape="rect">Section 8.16: Localization Quality Issue</a> and <a class="section-ref" href="#lqissue-typevalues" shape="rect">Appendix C: Values for the Localization Quality Issue Type</a> (plus a new informative <a class="section-ref" href="#localization-quality-guiance" shape="rect">Appendix H: Localization Quality Guidance</a>) to clarify the <a href="#lqissue" shape="rect">Localization Quality Issue</a> data category, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/132" shape="rect">issue-132</a>.</p></li></ol><p id="changelog-since-20130521">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2013/WD-its20-20130521/" shape="rect">ITS 2.0 Working Draft 21 May 2013</a>:</p><ol class="depth1"><li><p>Updated all text of the <a href="#html5-withintext-handling" shape="rect">HTML5 defaults for Element Within Text</a> and added example. +<h2><a href="#contents" shape="rect"><img src="images/topOfPage.gif" align="right" height="26" width="26" title="Go to the table of contents." alt="Go to the table of contents."/></a><a name="revisionlog" id="revisionlog" shape="rect"/>J Revision Log (Non-Normative)</h2><p id="changelog-since-20130820">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2013/WD-its20-20130820/" shape="rect">ITS 2.0 Working Draft 20 August 2013</a>:</p><ol class="depth1"><li><p>Updated reference to HTML5, see related <a href="http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2013Sep/0004.html" shape="rect">mail thread</a>.</p></li><li><p>Edits in <a class="section-ref" href="#lqissue" shape="rect">Section 8.16: Localization Quality Issue</a> and <a class="section-ref" href="#lqissue-typevalues" shape="rect">Appendix C: Values for the Localization Quality Issue Type</a> (plus a new informative <a class="section-ref" href="#localization-quality-guiance" shape="rect">Appendix H: Localization Quality Guidance</a>) to clarify the <a href="#lqissue" shape="rect">Localization Quality Issue</a> data category, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/132" shape="rect">issue-132</a>.</p></li><li><p>Edits in <a class="section-ref" href="#conversion-to-nif" shape="rect">Appendix F: Conversion to NIF</a> and <a class="section-ref" href="#nif-backconversion" shape="rect">Appendix G: Conversion NIF2ITS</a> to resolve <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/131" shape="rect">issue-131</a>.</p></li></ol><p id="changelog-since-20130521">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2013/WD-its20-20130521/" shape="rect">ITS 2.0 Working Draft 21 May 2013</a>:</p><ol class="depth1"><li><p>Updated all text of the <a href="#html5-withintext-handling" shape="rect">HTML5 defaults for Element Within Text</a> and added example. See <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/118" shape="rect">issue-118</a> and <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/532" shape="rect">action-532</a>.</p></li><li><p>Added a paragraph about mime type submission to <a class="section-ref" href="#its-mime-type" shape="rect">Appendix B: Internationalization Tag Set (ITS) MIME Type</a>, see step 3 (first bullet point) at <a href="http://www.w3.org/2002/06/registering-mediatype#Planned" shape="rect">Register an Internet Media Type for a W3C Spec</a> and <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/251" shape="rect">action-251</a>.</p></li><li><p>Removed company names from various examples, see <a href="http://www.w3.org/Search/Mail/Public/search?type-index=public-multilingualweb-lt-commits&index-type=t&keywords=action-502&search=Search" shape="rect">CVS commits</a> and <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/502" shape="rect">action-502</a>.</p></li><li><p>Reformatting of various examples, see <a href="http://lsts.w3.org/Archives/Public/public-multilingualweb-lt-commits/2013May/0066.html" shape="rect">CVS commit mail</a> and further CVS commit mails with the same send time.</p></li><li><p>Edits related to <a title="" href="#nif-reference" shape="rect">[NIF]</a>, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/125" shape="rect">issue-125</a>.</p></li><li><p>Edit related to <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/527" shape="rect">action-527</a>: put the sentence about foreign elements / attributes also in <a class="section-ref" href="#datacategories-defaults-etc" shape="rect">Section 8.1: Position, Defaults, Inheritance, and Overriding of Data Categories</a>.</p></li><li><p>Copy editing of spec, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/422" shape="rect">action-422</a>.</p></li><li><p>Edits related to <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/126" shape="rect">issue-126</a> (Minor ssue with quality types listing).</p></li><li><p>Edits related to <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/127" shape="rect">issue-127</a> (Clarifying HTML5 translate and global rules).</p></li><li><p>Edits related to <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/128" shape="rect">issue-128</a> (Various editorial edits + conformance section fixes).</p></li><li><p>Edits related to <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/129" shape="rect">issue-129</a> (sec1-2 editing (non-normative sec)).</p></li><li><p>Fixes about <a title="" href="#rfc2119" shape="rect">[RFC 2119]</a> statements, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/540" shape="rect">action-540</a>.</p></li><li><p>Various checks: spelling (using U.S. English), style, grammar, use of ":" etc.</p></li><li><p>Added a non-normative XML Schema for ITS 2.0 to <a class="section-ref" href="#its-schemas" shape="rect">Appendix D: chemas for ITS</a>, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/546" shape="rect">action-546</a>.</p></li><li><p>Updated link to quality issue type mappings in <a class="section-ref" href="#lqissue-typevalues" shape="rect">Appendix C: Values for the Localization Quality Issue Type</a>, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/543" shape="rect">action-543</a>.</p></li><li><p>Clarify conflicts of <code class="its-elem-markup">param</code> elements with the same name in <a class="section-ref" href="#selection-precedence" shape="rect">Section 5.5: Precedence between Selections</a> and <a class="section-ref" href="#html5-selection-precedence" shape="rect">Section 6.4: Precedence between Selections</a>, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/130" shape="rect">issue-130</a>.</p></li><li><p>Updated editors list.</p></li><li><p>Implemented <a href="http://lists.w3.org/Archives/Public/public-i18n-its-ig/013Jul/0041.html" shape="rect">editorial edits</a> in <a class="section-ref" href="#idvalue" shape="rect">Section 8.14: ID Value</a> and <a class="section-ref" href="#locNote-datacat" shape="rect">Section 8.3: Localization Note</a>.</p></li><li><p>Implemented <a href="http://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jul/0056.html" shape="rect">clarification</a> about <code class="its-attr-markup">annotatorsRef</code> attribute in <a class="section-ref" href="#its-tool-annotation" shape="rect">Section 5.7: ITS Tools Annotation</a>.</p></li><li><p>Implemented a <a href="http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2013Jul/0037.html" shape="rect">clarification</a> in <a class="section-ref" href="#mtconfidence-definition" shape="rect">Section 8.18.1: Definition</a> and added <a href="#mt-confidence-score-generation-tools" shape="rect">a note</a> that <a href="#mtconfidence" shape="rect">MT Confidence</a> information can be produced both by MT systems and other tool.</p></li><li><p>Changed the <a href="#conversion-to-nif" shape="rect">conversion to NIF</a> to be a non-normative feature.</p></li></ol><p id="changelog-since-20130411">The following log records major changes that have been made to this document since the <a href="http://www.w3.org/TR/2013/WD-its20-20130411/" shape="rect">ITS 2.0 Working Draft 11 April 2013</a>:</p><ol class="depth1"><li><p>Added a reference to the <a href="http://www.w3.org/2005/11/its/rdf#" shape="rect">ITS RDF Ontology</a> and an <a href="#its-rdf-ontology-status" shape="rect">explanatory note</a> about its status to <a class="section-ref" href="#conversion-to-nif" shape="rect">Appendix F: Conversion to NIF</a>, see <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/514" shape="rect">action-514</a>.</p></li><li><p>Updated <a class="section-ref" href="#conversion-to-nif" shape="rect">Appendix F: Conversion to NIF</a> to reflect <a href="http://www.w3.org/2013/05/08-mlw-lt-minutes#item04" shape="rect">MLW-LT May 013 f2f discussion</a>: <code>nif:occursIn</code> has changed to <code>nif:sourceUrl</code>, and <code>nif:convertedFrom</code> replaces <code>itsrdf:xpath2nif</code>. See <a href="https://www.w3.org/International/multilingualweb/lt/track/actions/517" shape="rect">action-517</a>.</p></li><li><p>Added <a href="#local-approach-not-applicable-to-attributes" shape="rect">a note</a> to <a class="section-ref" href="#basic-concepts-selection-local" shape="rect">Section 2.2.1: Local Approach</a> expressing that local selection does not apply to attributes, see <a href="https://www.w3.org/International/multilingualweb/lt/track/issues/98" shape="rect">issue-98</a>.</p></li><li><p>Added a <a href="#qa-issue-types-mappings" shape="rect">clarification</a> about the role of mappings from tools to quality issue types, --- /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd 2013/09/05 16:08:24 1.507 +++ /w3ccvs/WWW/International/multilingualweb/lt/drafts/its20/its20.odd 2013/09/06 10:06:48 1.508 @@ -5567,13 +5567,13 @@ <eg rend="text"><![CDATA[# Turtle example: @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> . @prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . -<http://example.com/exampledoc.html#char=b0,e0> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(x0)> . -<http://example.com/exampledoc.html#char=b1,e1> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(x1)> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=b0,e0> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=x0> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=b1,e1> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=x1> . # ... -<http://example.com/exampledoc.html#char=bn,en> - nif:wasConvertedFrom <http://example.com/exampledoc.html#xpath(xn)> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=bn,en> + nif:wasConvertedFrom <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=xn> . <!-- XML Example --> <mappings> <mapping x="xpath(x0)" b="b0" e="e0" /> @@ -5596,29 +5596,29 @@ @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> . @prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . # "Welcome to " -<http://example.com/exampledoc.html#char=0,11> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,11> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])>. + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[1]>. # "Dublin" -<http://example.com/exampledoc.html#char=11,17> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=11,17> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/span[1]/text()[1])>. + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/span[1]/text()[1]>. # " in " -<http://example.com/exampledoc.html#char=17,21> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=17,21> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[2])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[2]> . # "Ireland" -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/b[1]/text()[1]> . # "!" -<http://example.com/exampledoc.html#char=28,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=28,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[3])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[3]> . # "Welcome to Dublin Ireland!" -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text())> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()> . <!-- XML Example --> <mappings> <mapping x="xpath(/html/body[1]/h2[1]/text()[1])" b="0" e="11" /> @@ -5637,7 +5637,7 @@ </list> <eg rend="text"><![CDATA[@prefix itsrdf: <http://www.w3.org/2005/11/its/rdf#> . @prefix nif: <http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#> -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> rdf:type nif:Context ; rdf:type nif:RFC5147String ; # concatenate the whole text @@ -5646,19 +5646,19 @@ nif:endIndex "29" ; itsrdf:translate "yes"; nif:sourceUrl <http://example.com/exampledoc.html> . -<http://example.com/exampledoc.html#char=11,17> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=11,17> rdf:type nif:RFC5147String ; nif:beginIndex "11" ; nif:endIndex "17" ; itsrdf:translate "no"; itsrdf:taIdentRef <http://dbpedia.org/resource/Dublin> ; - nif:referenceContext <http://example.com/exampledoc.html#char=0,29> . -<http://example.com/exampledoc.html#char=21,28> + nif:referenceContext <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> . +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> rdf:type nif:RFC5147String ; nif:beginIndex "21" ; nif:endIndex "28" ; itsrdf:translate "no"; - nif:referenceContext <http://example.com/exampledoc.html#char=0,29> . + nif:referenceContext <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> . ]]></eg> <p>A complete sample output in RDF/XML format after step 7, given the input document <ptr target="#EX-HTML-whitespace-normalization" type="exref"/>, is available at <ref @@ -5669,6 +5669,27 @@ integrate these annotations into the original input document is given in <ptr target="#nif-backconversion" type="specref"/>. Many decisions to be made in this algorithm depend on the particular NLP application being used.</p></note> + <note> + <p>NIF allows URL for a String resource to be referenced as URIs + that are fragments of the original document in the form:<?br?> + <code>http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html#char=0,11</code> + <?br?>or + <?br?><code>http://example.com/doc.html#xpath(/html/body[1]/h2[1]/text()[1])</code><?br?> + + This offers a convenient mechanism for linking NIF resources in RDF back + to the original document. RDF treats URIs as opaque and does not impose + any semantic constraints on the used fragment identifiers, thus enabling + their usage in RDF in a consistent manner. However, fragment identifiers + get interpreted according to the retrieved mime type, if a retrieval + action occurs as is the case in Linked Data. The char fragment is + defined currently only for text/plain while the xpath fragment is not + defined for HTML. Therefore this URL recipe does fulfil the ITS + requirements to support both XML and HTML and the aim of this mapping to + produce resources adhering to the Linked Data principle of + dereferenceablility. The future definition and registration of these + fragment types, while a potentially attractive feature, is beyond the + scope of this specification.</p> + </note> </div> <div xml:id="nif-backconversion" type="inform"> <head>Conversion NIF2ITS</head> @@ -5681,7 +5702,7 @@ target="https://github.com/dbpedia-spotlight/dbpedia-spotlight">DBpedia Spotlight</ref> as an example natural language processing (NLP) tool. In it, DBpedia Spotlight linked "Ireland" to DBpedia:</p> - <eg rend="text"><![CDATA[<http://example.com/exampledoc.html#char=21,28> + <eg rend="text"><![CDATA[<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> rdf:type nif:RFC5147String; itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . <http://dbpedia.org/resource/Ireland> @@ -5693,7 +5714,7 @@ <p xml:id="nif2its-algorithm-step1">STEP 1: NIF Web services accept two different types of input. It is possible to either send the extracted text (the object of the <code>nif:isString</code> property) directly or NIF RDF to the NLP tool, i.e. the text is sent as a <code>nif:Context</code> node and included as <code>nif:isString</code>. Either way, the output of the Web service will be a NIF representation. </p> <p>Accepting text will be the minimal requirement of a NIF web service. Ideally, you would be able to send the <code>nif:Context</code> node with the isString as RDF directly, which has the advantage, that all other annotations can be used by the NLP tool:</p> - <eg><![CDATA[<http://example.com/exampledoc.html#char=0,29> + <eg><![CDATA[<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> rdf:type nif:RFC5147String ; rdf:type nif:Context ; nif:beginIndex "0" ; @@ -5710,11 +5731,11 @@ <p>CASE 1: The NLP annotation created in NIF matches the text node. Solution: Attach the annotation to the parent element of the text node.</p> <eg rend="text"><![CDATA[# Based on: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/b[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/b[1]/text()[1]> . # and: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # we can attach the metadata to the parent node: <b its-ta-ident-ref="http://dbpedia.org/resource/Dublin" @@ -5733,13 +5754,13 @@ # ITS2NIF -<http://example.com/exampledoc.html#char=0,29> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=0,29> nif:wasConvertedFrom - <http://example.com/exampledoc.html#xpath(/html/body[1]/h2[1]/text()[1])> . + <http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&xpath=/html/body[1]/h2[1]/text()[1]> . # DBpedia Spotlight returns: -<http://example.com/exampledoc.html#char=21,28> +<http://example.com/myitsservice?informat=html&intype=url&input=http://example.com/doc.html&char=21,28> itsrdf:taIdentRef <http://dbpedia.org/resource/Ireland> . # NIF2ITS @@ -5944,6 +5965,7 @@ <list type="ordered"> <item>Updated reference to HTML5, see related <ref target="http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2013Sep/0004.html">mail thread</ref>.</item> <item>Edits in <ptr target="#lqissue" type="specref"/> and <ptr target="#lqissue-typevalues" type="specref"/> (plus a new informative <ptr target="#localization-quality-guidance" type="specref"/>) to clarify the <ref target="#lqissue">Localization Quality Issue</ref> data category, see <ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/132">issue-132</ref>.</item> + <item>Edits in <ptr target="#conversion-to-nif" type="specref"/> and <ptr target="#nif-backconversion" type="specref"/> to resolve <ref target="https://www.w3.org/International/multilingualweb/lt/track/issues/131">issue-131</ref>.</item> </list> <p xml:id="changelog-since-20130521">The following log records major changes that have been made to this document since the <ref target="http://www.w3.org/TR/2013/WD-its20-20130521/">ITS 2.0 Working Draft 21 May 2013</ref>:</p> <list type="ordered">
Received on Friday, 6 September 2013 10:06:51 UTC