action-374 test cases distribution from Felix Sasaki on 2012-12-18 (public-multilingualweb-lt@w3.org from December 2012)

From: Felix Sasaki <fsasaki@w3.org>
Date: Tue, 18 Dec 2012 15:31:57 +0100
To: "public-multilingualweb-lt@w3.org" <public-multilingualweb-lt@w3.org>
Message-ID: <50D07E5D.80106@w3.org>
Hi all,

I went through the rfc 2119 list, and below are some items that need at 
least a clarification or a test case.

I put names people in the list below. If you don't want you name here or 
have questions please let me know. I will keep the action item open and 
come back to it on the 7 January call. That is, please be prepared to 
give an update on "your" test cases during that call. If that's too 
early for you please let me know.

For some of the below MUST / MUST NOT statements the actual test suite 
might do. For others you may want to create negative or positive tests, 
e.g. no. 29 / 35 / 41.


- I clarified no. 16 "mutal exclusive attributes", now coveredy by the 
schema after closing issue-59.


- No. 26 "Values of attributes which corresponds to data categories with 
a predefined set of values MUST be matched case-insensitively." We had 
discussed this before, see this thread
http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2012Dec/0026.html
I think we would need a test case for this or several ones, e.g. ITS 2.0 
implementations should process HTML, e.g. "span" like
<span ... its-loc-note-type="aLerT">...
identical to
<span ... its-loc-note-type="alert">
Thoughts and volunteers?


- No. 28 " Inline global rules MUST be specified inside script which has 
type attribute with the value application/its+xml. The script element 
itself SHOULD be child of head element. Comments MUST NOT be used inside 
global rules. Each script element MUST NOT contain more than one rules 
element.": covered by
http://lists.w3.org/Archives/Public/public-multilingualweb-lt-tests/2012Dec/0143.html 
, I think


- No. 29 " Any node selected by the terminology data category with the 
termConfidence attribute specified MUST be contained in an element with 
the annotatorsRef (or in HTML its-annotators-ref) attribute specified 
for the Terminology data category. See for more information.": Philip


- No. 31 " An optional domainMapping attribute that contains a comma 
separated list of mappings between values in the content and consumer 
tool specific values. The left part of the pair corresponds to the 
source content and is unique within the mapping and case-insensitive. 
The right part of the mapping belongs to the consumer tool. Several left 
parts can map to a single right part. The values in the left or the 
right part of the mapping may contain spaces; in that case they MUST be 
delimited by quotation marks, that is pairs of APOSTROPHE (U+0027) or 
QUOTATION MARK (U+0022).": Ankit


- No. 35 " Any node selected by the disambiguation data category with 
the disambigConfidence attribute specified MUST be contained in an 
element with the annotatorsRef (or in HTML its-annotators-ref) attribute 
specified for the disambiguation data category. ": Tadej


- No. 36 "Implementations MUST NOT combine lists of language ranges from 
multiple rules or local attributes.": Not needed IMO


- No. 38 " A required idValue attribute. It contains an XPath expression 
which constructs a string corresponding to the identifier of the node to 
which this rule applies. The identifier MUST be unique at least within 
the document. If the attribute xml:id is present or id in HTML for the 
selected node, the value of the xml:id attribute or id in HTML MUST take 
precedence over the idValue value.": Yves


- No. 41 " Any node selected by the MT Confidence data category MUST be 
contained in an element with the annotatorsRef (or in HTML, 
its-annotators-ref) attribute specified for the MT Confidence data 
category. For more information, see. ": Ankit


- No 43 + 44 " A storageEncoding attribute. It contains the name of the 
character set encoding used to calculate the number of bytes of the 
selected text. The name MUST be one of the names or aliases listed in 
the IANA Character Sets registry . The default value is UTF-8.": Karl


- We have various tests related to quality: 45, 46, 47, 48. I put Arle 
and Phil here. Phil had a question about these during the Monday call. 
Here is proposal how one might test these statements.

No. 45 "T he values listed in the following table are allowed for 
locQualityIssueType. The values a tool implementing the data category 
produces for the attribute MUST match one of the values provided in this 
table and MUST be semantically accurate. If a tool can map its internal 
values to these types it MUST do so and MUST NOT use the value other, 
which is reserved strictly for values that cannot be mapped to these 
values.": a test might be negative, e.g.:
- 1) have a manual annotated ITS 2.0 file with the correct error type 
like "terminology"
- 2) have a "fake" output of a tool with the wrong error type
- say in the test description that a tool should consume 1), but raise 
an error about the mismatch between 1) and 2)

No. 46 " This value MUST NOT be used for simple typographical errors or 
word choice not related to defined terminologies. For example, a 
mistyping of "pin" as "pen" or the use of "imply" instead of "infer" 
(mistaking two commonly confused words) would not count as terminology 
issues and should be categorized as either spelling errors or 
mistranslations, depending on the nature of the issue. Terminology 
refers only to cases where incorrect choices about terms (either formal 
or commonly defined in a domain) are involved.": again a negative test: 
having mistyping in an input file, then 1) a correct annotion 2) an 
incorrect annotation as "type", and say that 2) should not be processed 
by the tool.

- No. 47, 48: negative tests like 45 / 46.


Best,

Felix
Received on Tuesday, 18 December 2012 14:32:29 UTC