W3C home > Mailing lists > Public > public-multilingualweb-lt-tests@w3.org > December 2012

RE: Must and must not statements >Domain

From: Yves Savourel <ysavourel@enlaso.com>
Date: Fri, 14 Dec 2012 11:48:19 -0700
To: "'Multilingual Web LT-TESTS Public LT-TESTS Public'" <public-multilingualweb-lt-tests@w3.org>
Message-ID: <assp.0695c1a1b3.assp.069502cbcf.00d201cdda2b$95f06120$c1d12360$@com>
It’s a good question Pablo.

First Does it follow the current algorithm?
I think it does (but a second opinion is certainly needed)
We say that we are case-insensitive only when doing the mapping, but the values keep their cases. that was a requirement for Lucy I think.

Now, we also say we remove the duplicates.
I think those two are duplicates not since the values are apparently case sensitive for Domain.

So I think the output corresponds to the algorithm.

But it's definitely a case where there is room for debate on whether the algorithm should allow that or not.


From: Pablo Nieto Caride [mailto:pablo.nieto@linguaserve.com] 
Sent: Friday, December 14, 2012 11:40 AM
To: Yves Savourel; 'Multilingual Web LT-TESTS Public LT-TESTS Public'
Subject: RE: Must and must not statements >Domain

Thank very much Yves, I was looking forward to seeing an example like this, in fact I was discussing with Mauricio today how to implement in HTML several domain entries. Just one consideration the output is:
domains="LAW, Judicial Matters, Sport, Law"

Would it be correct to have repeated values even if they are in different cases?


I’ve attached a test case that covers:

- embedded rules in HTML
- several domain entries
- multi-words values
- case preservation for non-mapped values
- non-case-sensitivity for mapping, but preservation of case for mapped value
- parameter

I think the resulting string follows the algorithm. But double-checking is very needed.
I didn’t put the files in GitHub as this may need fine-tuning.


From: Leroy Finn [mailto:finnle@tcd.ie] 
Sent: Friday, December 14, 2012 6:30 AM
To: Yves Savourel
Cc: Multilingual Web LT-TESTS Public LT-TESTS Public
Subject: Re: Must and must not statements >Domain

If you can create one test files that can cover this that would be great. Otherwise I can do it.


On 14 December 2012 12:23, Yves Savourel <ysavourel@enlaso.com> wrote:
Hi Mauricio, Leroy, all,
+1. and the same on the HTML side: it seems several things are not covered:
a)  converting from multi-words domain using the mapping
b)  using multi-values domains in the input
c)  case preservation and case-insensitive mapping
d)  there is also no coverage in HTML for embedded global rules.
I can try to propose one case that covers all this.
From: Mauricio del Olmo [mailto:mauricio.delolmo@linguaserve.com] 
Sent: Friday, December 14, 2012 4:01 AM
To: 'Tadej Štajner'; milank@moraviaworldwide.com
Cc: 'Multilingual Web LT-TESTS Public LT-TESTS Public'
Subject: Must and must not statements >Domain
I’m not directly appointed to do this but, since one of the first data categories I’m implementing is Domain, I’d like to contribute on this.
The “Ownership of rfc2119 statements” table says for Domain:
An optional domainMapping attribute that contains a comma separated list of mappings between values in the content and consumer tool specific values. The left part of the pair corresponds to the source content and is unique within the mapping and case-insensitive. The right part of the mapping belongs to the consumer tool. Several left parts can map to a single right part. The values in the left or the right part of the mapping may contain spaces; in that case they MUST be delimited by quotation marks, that is pairs of APOSTROPHE (U+0027) or QUOTATION MARK (U+0022).
We have several test suite files that include mapping values between apostrophes, but none of them applies to the output.
Maybe we could make a slightly change to one of the test suite samples (domain6xml.xml or domain7xml.xml):
<meta name="DC.type" content="literature"/> -> <meta name="DC.type" content="Classical literature"/>
The mapping is: domainMapping="'Classical literature' english"
And the output would be:
/text/body[1]/p[1]        domains="english"
/text/body[1]/p[1]/span[1]       domains="english"
/text/body[1]/p[1]/span[2]       domains="english"
Is this enough to advance or we have to create a new test file from scratch?
We could be picky and say that the content is not really about English classical literature just about literature:
A poem is a composition written in verse (although verse has also been used for epic and dramatic fiction).
Poems make use of the aesthetic qualities of language to suggest differential meanings and to evoke emotive responses. Poems rely heavily on imagery and metaphor; they may have a rhythmic structure based on patterns of stresses (metric feet) or on patterns of different-length syllables (as in classical prosody); and they may or may not utilize rhyme.
Mauricio del Olmo Martínez
Dpto. Técnico/I+D+i
Linguaserve Internacionalización de Servicios, S.A.
Tel.: +34 91 761 64 60 ext. 0421
Fax: +34 91 542 89 28 
E-mail: tecnico@linguaserve.com
«En cumplimiento con lo previsto con los artículos 21 y 22 de la Ley 34/2002, de 11 de julio, de Servicios de la Sociedad de Información y Comercio Electrónico, le informamos que procederemos al archivo y tratamiento de sus datos exclusivamente con fines de promoción de los productos y servicios ofrecidos por LINGUASERVE INTERNACIONALIZACIÓN DE SERVICIOS, S.A. En caso de que Vdes. no deseen que procedamos al archivo y tratamiento de los datos proporcionados, o no deseen recibir comunicaciones comerciales sobre los productos y servicios ofrecidos, comuníquenoslo a clients@linguaserve.com, y su petición será inmediatamente cumplida.»
"According to the provisions set forth in articles 21 and 22 of Law 34/2002 of July 11 regarding Information Society and eCommerce Services, we will store and use your personal data with the sole purpose of marketing the products and services offered by LINGUASERVE INTERNACIONALIZACIÓN DE SERVICIOS, S.A. If you do not wish your personal data to be stored and handled, or you do not wish to receive further information regarding products and services offered by our company, please e-mail us to clients@linguaserve.com. Your request will be processed immediately.”
-----Mensaje original-----
De: Felix Sasaki [mailto:fsasaki@w3.org] 
Enviado el: jueves, 13 de diciembre de 2012 12:39
Para: Dominic Jones
CC: Multilingual Web LT-TESTS Public LT-TESTS Public; Multilingual Web LT Public List
Asunto: Re: Important: Achieving Test Suite M2 (17/12/12)
Thank you very much, Dom. Let me also emphasize the importance of this mailstone. As written in the mail on Monday http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Dec/0055.html
we need contributions related to tests from many implementors, and M2 can only be reached if the related tests are available. So please make the time or let us know how we can help you to move forward.
Thanks again for all your contributions, and looking very much forward to Monday,
Am 13.12.12 12:32, schrieb Dominic Jones:
> (M2: 17/12/12 Implementers have run at least one global and one local 
> test file, for each of their data categories. One XML and one HTML as 
> well. (Partial))
> Dear Implementors.
> I appreciate all your effort and hard work in beginning to process example inputs and providing partially conformant output. I wanted to remind you of our first major milestone, which is this coming Monday the 17th. Over the coming days I'd ask you to mark up the data categories you have started working on, those that can be classed as partially conformant, in the google doc:
> https://docs.google.com/spreadsheet/ccc?key=0AgIk0-aoSKOadG5HQmJDT2Eyb
> WVvVC1VbnF5alN2S3c#gid=0
> I am hesitant to let these milestones slip, but of course am aware you are under many constraints. Please let me know sooner rather than later if you will be having issues meeting this milestone. Remember we're only looking for one data feature (in XML and HTML) from each of the categories you have committed to testing, not the whole category!
> Many thanks.
> Dom.
> --
> Dominic Jones | Research Assistant
> KDEG, Trinity College Dublin, Ireland.
Received on Friday, 14 December 2012 18:48:52 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:32:56 UTC