MultilingualWeb-LT working group: question about CURIEs and their use outside RDFa.

Hi all, 

In the context of the MultilingualWeb-LT Working Group, we specify the successor of ITS 1.0, where not only XML documents may be annotated with i18n data categories, but also HTML5. 

Some new data categories are introduced and their value could be any URI. 
Thus we naturally thought about Compacted URIs to limit the verbosity of annotated documents. 

I have three questions about the use of CURIEs outside the RDFa context:

Question 1:
In HTML4, HTML5 or XHTML5, to be able to correctly process the SafeCURIEorCURIEorIRI and SafeCURIEorCURIEorIRIs datatypes, I understand we need to re-use the @vocab and the @prefix attributes as defined in RDFa 1.1. Are these two attributes already in the HTML5 dtd/schema, or would we need to write an "Extension to the HTML5 Syntax" section like in [1]

> . Extensions to the HTML5 Syntax
>
> There are a few attributes that are added as extensions to the HTML5 syntax in order to fully support ITS 2.0:
>
> If HTML+ITS document conformance is desired, the two RDFa attributes @vocab and @prefix and valid values (including CURIEs), as described in RDFa Lite 1.1, Section 2: The Attributes, must be allowed and validate as conforming when used in an HTML4, HTML5 or XHTML5 document. 
> If HTML+ITS document conformance is desired, all ITS attributes as listed in ITS 2.0, Section XXX, must be allowed and validate as conforming when used in an HTML4, HTML5 or XHTML5 document. 


Question 2:
A first draft of the section about the Use of CURIEs datatype in ITS 2.0 is the following. Please correct/comment/rephrase at will:

-----
Section: Use of CURIEs Datatype in ITS 2.0

Some of the data categories need their values to be a URI or a set of URIs, but these are usually long and unwieldy. RDFa supports a mechanism by which IRIs can be abbreviated, called 'compact URI expressions' or simply, CURIEs. (See section 6. CURIE Syntax Definition http://www.w3.org/TR/rdfa-core/#s_curies).

The following datatypes are used: 
1. SafeCURIEorCURIEorIRI http://www.w3.org/TR/rdfa-core/#dfn-safecurieorcurieoriri : 
  - in XML: @its:semanticNetworkRef, @its:meaningRef, @its:termLexiconRef, @its:termInfoRef, @its:entityRef, @its:entityTypeRef, @its:externalPlaceholderRef, @its:qualityProfileRef
  - in HTML: @its-semantic-network-ref, @its-meaning-ref, @its-term-lexicon-ref, @its-term-info-ref, @its-entity-ref, @its-entity-type-ref, @its-external-placeholder-ref, @its-quality-profile-ref
2. SafeCURIEorCURIEorIRIs http://www.w3.org/TR/rdfa-core/#dfn-safecurieorcurieoriris : 
  - in XML: @its:domainRef
  - in HTML: @its-domain-ref

For (X)HTML5, Only two RDFa-core attributes are needed to enable the drastic lightening of the content of the concerned data categories :
1. the @prefix attribute http://www.w3.org/TR/rdfa-core/#A-prefix
2. the @vocab attribute http://www.w3.org/TR/rdfa-core/#A-vocab

These attributes MAY or MUST be replaced in XML and XHTML as follows:
1. In XML, the @vocab attribute MUST be replaced in favor of the @xmlns or @xmlns:base attribute, in accordance with http://www.w3.org/TR/rdfa-core/#PS-default-vocabulary
2. In XHTML, the @vocab attribute MAY be replaced in favor of the @xmlns or @xmlns:base attribute, in accordance with http://www.w3.org/TR/rdfa-core/#PS-default-vocabulary
3. In XML, the @prefix attribute MUST be replaced in favor of multiple  xmlns:someprefix attributes, in accordance with http://www.w3.org/TR/rdfa-core/#PS-IRI-mappings
4. In XHTML, the @prefix attribute MAY be replaced in favor of multiple  xmlns:someprefix attributes, in accordance with http://www.w3.org/TR/rdfa-core/#PS-IRI-mappings

An implementation of the concerned datacategories MUST process CURIE (expand them) and IRI values as specified in RDFa-core recommendation, section 7.4 - CURIE and IRI processing http://www.w3.org/TR/rdfa-core/#s_curieprocessing.

-----

Question4:
Is there anything else we need to take care of so that everything goes smoothly ?


[1] http://dev.w3.org/html5/rdfa/#extensions-to-the-html5-syntax
[2] http://www.w3.org/TR/rdfa-core/#PS-default-vocabulary
[3] http://www.w3.org/TR/rdfa-core/#PS-IRI-mappings


Kind regards, 

Maxime Lefrançois 
Ph.D. Student, INRIA - WIMMICS Team 
http://maxime-lefrancois.info 
@Max_Lefrancois 

Received on Thursday, 21 June 2012 13:34:25 UTC