- From: Felix Sasaki <fsasaki@w3.org>
- Date: Tue, 2 Oct 2012 10:16:27 +0200
- To: public-multilingualweb-lt@w3.org
- Message-ID: <CAL58czq5Ze+JpR8-6JdZQf6tH9pS_bSUpczKsKCvXyiT1WdA8w@mail.gmail.com>
Hi all,
I updated the qaissue example to use XML in the script element, see
http://www.w3.org/International/multilingualweb/lt/drafts/its20/its20.html#EX-locQualityIssue-html5-local-2
the standoff metadata is now in a dedicated "script" element. See also
http://www.w3.org/International/multilingualweb/lt/drafts/its20/examples/html5/EX-locQualityIssue-html5-local-2.html
http://www.w3.org/International/multilingualweb/lt/drafts/its20/examples/html5/qaissues.js
So this works, but I have a question to the implementors using HTML5 as an
input for processing outside the browser.
If you process
http://www.w3.org/International/multilingualweb/lt/drafts/its20/examples/html5/EX-locQualityIssue-html5-local-2.html
with the validator.nu HTML5 parser, the content of "script" is not "seen"
as XML. The output then is
<html xmlns="http://www.w3.org/1999/xhtml">...
<script type="application/xml" id="its-standoff-1">
<its:locQualityIssues xml:id="lq1" xmlns:its="
http://www.w3.org/2005/11/its">
<its:locQualityIssue
locQualityIssueType="misspelling"
locQualityIssueComment="'c'es' is unknown. Could be 'c'est'"
locQualityIssueSeverity="50"/>
<its:locQualityIssue
locQualityIssueType="typographical"
locQualityIssueComment="Sentence without capitalization"
locQualityIssueSeverity="30"/>
</its:locQualityIssues>
</script>...</html>
So if we would have an XML-based tool that wants to pick up the ITS
standoff information, it won't work.
Currently, Linguaserve is using this approach
https://www.w3.org/International/multilingualweb/lt/wiki/LSP_Localization_Chain_Side_Use_Case_Demonstration
to embed ITS rules into an HTML file. I had hoped that the "script" element
would have been an alternative - is it?
I'm sure this is not a difficult problem, but we probably need some
guidance for implementors who are not used to process HTML5.
Felix
--
Felix Sasaki
DFKI / W3C Fellow
Received on Tuesday, 2 October 2012 08:16:51 UTC