- From: Leroy Finn <finnle@tcd.ie>
- Date: Fri, 18 Jan 2013 10:28:14 +0000
- To: Pablo Nieto Caride <pablo.nieto@linguaserve.com>
- Cc: Felix Sasaki <fsasaki@w3.org>, Ankit Srivastava <asrivastava@computing.dcu.ie>, Multilingual Web LT-TESTS Public <public-multilingualweb-lt-tests@w3.org>
- Message-ID: <CAMYWBwsWUF5G6sds-5ta30JpHAX7uDcWP09m2=Uc3_zLfbyexA@mail.gmail.com>
I ran the language information tests for HTML and I am getting the incorrect output also. Thanks for catching this Marcis. I will have to correct my parser for language information and i better double check the XML outputs also. I will post up the correct output by the end of the day. Thanks, Leroy On 18 January 2013 09:56, Pablo Nieto Caride <pablo.nieto@linguaserve.com>wrote: > Hi Felix,**** > > ** ** > > Yes I definitely thought about this issue when I was implementing the data > category, but then I came to realize that since the value of “Global > adding of information” is NO then the rules cannot add the lang attribute > to the elements, then the inheritance is another matter, I thought that the > reason why it isn’t shown on the output was because the lang is not ITS but > a HTML5 attribute. Anyway according to this thread I assume I was wrong and > I need to change the implementation, don’t I?**** > > ** ** > > Cheers,**** > > Pablo.**** > > >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>** ** > > Hi Pablo and Ankit, > > just FYI & in case you may have missed this thread - you as "language > information" implementers may need to change some test files, see below. > > Best, > > Felix**** > > > > -------- Original-Nachricht -------- **** > > *Betreff: * > > RE: [LT-Web] Test Suite output**** > > *Weitersenden-Datum: * > > Fri, 18 Jan 2013 04:08:33 +0000**** > > *Weitersenden-Von: * > > public-multilingualweb-lt-tests@w3.org**** > > *Datum: * > > Thu, 17 Jan 2013 21:07:54 -0700**** > > *Von: * > > Yves Savourel <ysavourel@enlaso.com> <ysavourel@enlaso.com>**** > > *An: * > > 'Felix Sasaki' <fsasaki@w3.org> <fsasaki@w3.org>, 'Multilingual Web > LT-TESTS Public' <public-multilingualweb-lt-tests@w3.org><public-multilingualweb-lt-tests@w3.org> > **** > > *Kopie (CC): * > > 'Mārcis Pinnis' <marcis.pinnis@Tilde.lv> <marcis.pinnis@Tilde.lv>, > 'Pēteris Ņikiforovs' <peteris.nikiforovs@Tilde.lv><peteris.nikiforovs@Tilde.lv>, > 'Andis Lagzdiņš' <andis.lagzdins@Tilde.lv> <andis.lagzdins@Tilde.lv>**** > > > > > **** > > Hi Mārcis,**** > > **** > > I’ve checked our code and we had a bug in our implementation: the handler > for the local HTML’s lang was not defined.**** > > So I’m now getting the three attached files for HTML.**** > > **** > > There may be still another problem in our code as we still get the same > output for html1.**** > > **** > > I think that test needs to be redone because the way it is: we don’t know > if the result is generated by the local lang or by the global rule.**** > > (and the global rules are a bit weird too: the first one serves no purpose) > **** > > **** > > I’ll try to find some time to look more into it tomorrow. Hopefully others > can double/triple check the output.**** > > **** > > cheers,**** > > -yves**** > > **** > > **** > > **** > > *From:* Felix Sasaki [mailto:fsasaki@w3.org <fsasaki@w3.org>] > *Sent:* Thursday, January 17, 2013 11:49 AM > *To:* Multilingual Web LT-TESTS Public > *Cc:* Mārcis Pinnis; Pēteris Ņikiforovs; Andis Lagzdiņš > *Subject:* [LT-Web] Test Suite output**** > > **** > > **** > > Forwarded on behalf of Mārcis (there is a subscription issue for the tests > list) and putting Mārcis and his colleagues in CC. Implementers, please > have a look at Mārcis' issue. > > Thanks, > > Felix > > -------- Original-Nachricht -------- **** > > *Betreff: ***** > > [Moderator Action] [LT-Web] Test Suite output**** > > *Datum: ***** > > Thu, 17 Jan 2013 14:47:39 +0000**** > > *Von: ***** > > Mārcis Pinnis <marcis.pinnis@Tilde.lv> <marcis.pinnis@Tilde.lv>**** > > *An: ***** > > Multilingual Web LT-TESTS Public LT-TESTS Public > <public-multilingualweb-lt-tests@w3.org><public-multilingualweb-lt-tests@w3.org> > **** > > *Kopie (CC): ***** > > Felix Sasaki <fsasaki@w3.org> <fsasaki@w3.org>, Pēteris Ņikiforovs > <peteris.nikiforovs@Tilde.lv> <peteris.nikiforovs@Tilde.lv>, Andis > Lagzdiņš <andis.lagzdins@Tilde.lv> <andis.lagzdins@Tilde.lv>**** > > > > > > **** > > Hi everyone,**** > > **** > > We (in Tilde) are working through the Test Suite and for Language > Information we find the expected results not to represent what is said in > http://www.w3.org/TR/its20/#language-information and > http://www.w3.org/TR/its20/#datacategories-defaults-etc.**** > > **** > > The first question:**** > > **** > > The input in HTML example 1 is as follows:**** > > <!DOCTYPE html>**** > > <html lang="en">**** > > <head>**** > > <meta charset=utf-8>**** > > <link href="languageinfo1htmlrules.xml" rel="its-rules"/>**** > > <title>EXAMPLE</title>**** > > </head>**** > > <body>**** > > <p>The motto of Québec is:**** > > <q>Je me souviens</q>**** > > .</p>**** > > <p>La devise du Québec est :**** > > <q lang="fr-CA">Je me souviens</q>**** > > .</p>**** > > </body>**** > > </html>**** > > **** > > The acompanying rules file defines the following:**** > > <its:rules xmlns:its="http://www.w3.org/2005/11/its"<http://www.w3.org/2005/11/its>xmlns:h= > "http://www.w3.org/1999/xhtml" <http://www.w3.org/1999/xhtml>version="2.0"> > **** > > <its:langRule selector="/h:*" langPointer="@lang"/>**** > > <its:langRule selector="//h:*" langPointer="@lang"/>**** > > </its:rules>**** > > **** > > The expected output is:**** > > /html lang="en"**** > > /html/@lang lang="en"**** > > /html/head[1]**** > > /html/head[1]/meta[1]**** > > /html/head[1]/meta[1]/@charset**** > > /html/head[1]/link[1]**** > > /html/head[1]/link[1]/@href**** > > /html/head[1]/link[1]/@rel**** > > /html/head[1]/title[1]**** > > /html/body[1]**** > > /html/body[1]/p[1]**** > > /html/body[1]/p[1]/q[1]**** > > /html/body[1]/p[2]**** > > /html/body[1]/p[2]/q[1] lang="fr-CA"**** > > /html/body[1]/p[2]/q[1]/@lang lang="fr-CA"**** > > **** > > However, our parser produces the following (and we tend to believe that > this is correct!):**** > > /html lang="en"**** > > /html/@lang lang="en"**** > > /html/head[1] lang="en"**** > > /html/head[1]/meta[1] lang="en"**** > > /html/head[1]/meta[1]/@charset lang="en"**** > > /html/head[1]/link[1] lang="en"**** > > /html/head[1]/link[1]/@href lang="en"**** > > /html/head[1]/link[1]/@rel lang="en"**** > > /html/head[1]/title[1] lang="en"**** > > /html/body[1] lang="en"**** > > /html/body[1]/p[1] lang="en"**** > > /html/body[1]/p[1]/q[1] lang="en"**** > > /html/body[1]/p[2] lang="en"**** > > /html/body[1]/p[2]/q[1] lang="fr-CA"**** > > /html/body[1]/p[2]/q[1]/@lang lang="fr-CA"**** > > **** > > I marked the difference in red.**** > > **** > > The language information data category specifies the following inheritance > rules:**** > > „Textual content of element, *including* attributes and child elements”*** > * > > **** > > This (as I understand it) means that everything within <html> is in > English except the second <q> tag.**** > > **** > > This issue is in all three HTML language information examples in the Test > Suite.**** > > **** > > A second question:**** > > I find confusing the definition and parsing of the other two language > information examples.**** > > **** > > I understand that language information is added at a global level, > however, the ITS 2.0 reference says:**** > > Locally users are able to use xml:lang (which is defined by XML), or langin HTML, or an attribute specific to the format in question (as in Example > 51 <http://www.w3.org/TR/its20/#EX-lang-definition-1>).**** > > **** > > After reading this sentence I have the understanding that the „lang” > attribute is equally important for language information parsing than global > rules (regardless of where they point).**** > > **** > > That being said, is the expected output correct in the second and third > examples (the first lang =”en” fragment was ignored in the expected output)? > **** > > **** > > Best regards,**** > > Mārcis ;o)**** > > **** > > **** > > ** ** > > ** ** >
Received on Friday, 18 January 2013 10:28:42 UTC