W3C home > Mailing lists > Public > public-rdfa-wg@w3.org > March 2011

Re: Review of latest RDFa Core 1.1

From: Shane McCarron <shane@aptest.com>
Date: Thu, 10 Mar 2011 09:38:34 -0600
Message-ID: <4D78F07A.2030309@aptest.com>
To: Ivan Herman <ivan@w3.org>
CC: Gregg Kellogg <gregg@kellogg-assoc.com>, RDFa WG <public-rdfa-wg@w3.org>
During the meeting today the group discussed your concern about Host 
Language discovery.  The group agreed to add the following text:

<p class="note">A conforming RDFa Processor MAY use additional 
mechanisms (e.g., the DOCTYPE, a file extension, the root element) to 
attempt to determine the Host Language if the media type is 
unavailable.  These mechanisms are unspecified.</p>


On 3/7/2011 3:17 AM, Ivan Herman wrote:
> My implementation looks at the file extension if no media type is available. Actually, I am not really sure about these things; but most of the media types include suggested file extensions. Because we are talking about a very very small number of media types that we have to recognize (strictly speaking, we have to recognize HTML5, XHTML and that is it; if we want to be nice, one would recognize SVG and, if Toby really writes the note, Atom, too), this seems to be quite enough in practice. I do not do sniffing on doc type.
>
> Ivan
>
>
> On Mar 6, 2011, at 03:18 , Shane McCarron wrote:
>
>> Gregg,
>>
>> Thanks for your comments.  My replies are inline:
>>
>> On 3/5/2011 3:05 PM, Gregg Kellogg wrote:
>>> I've been doing my own review and updating my processor at the same time. Here are some of my notes:
>>>
>>> Notes on review of 3/1 Editors Draft of RDFa Core 1.1:
>>> http://www.w3.org/2010/02/rdfa/sources/rdfa-core/Overview.html
>>> .
>>>
>>> Section 4.1 RDFa Processor Conformance
>>>
>>> Here it states that the processor *must* examine the media type and process the document as application/xml if it is unable to determine the media type. However, there is no description of how the processor should make this determination. Presumably, for a document retrieved over HTTP, the media type can be determined by examining the Content-Type header to determine this. However, what about documents not retrieved via a medium which reports Content-Type? For example, can a file extension be used to determine the media type of a file on a local file system (e.g., .html, .xhtml, .svg, etc.). I don't see that IANA defines associated file extensions for media types.
>>>
>>> Is it acceptable to examine the root element name to determine the document type?
>>>
>>> This section requires some clarification.
>>>
>> The working group debated this, and decided that it was impossible to specify anything beyond media type.  I personally agree that the DOCTYPE is a fine way to announce the type of a document (hence the name).  But this sort of 'sniffing' seemed beyond the will of the working group. For similar reasons, defining specific suffixes is something that is certainly not going to happen.
>>
>> Personally, I would be open to changing the text to read like the following:
>> A conforming RDFa Processor must examine the media type of a document it is processing to determine the document's Host Language. If the media type is unavailable, a conforming RDFa Processor MAY look at the document's DOCTYPE to determine if its Formal Public Identifier matches that of a known Host Language.  If the RDFa Processor is unable to determine the document's Host Language, or does not support the Host Language, the RDFa Processor must process the document as if it were media type application/xml. See XML+RDFa Document Conformance.
>> However, note that the text above PRECLUDES the HTML5-style doctype with no FPI.  There might be a way to encompass that in this clause... but I personally feel that the absence of an FPI really can't be taken as announcement that a document is written in a specific Host Language.
>>
>
> ----
> Ivan Herman, W3C Semantic Web Activity Lead
> Home: http://www.w3.org/People/Ivan/
> mobile: +31-641044153
> PGP Key: http://www.ivan-herman.net/pgpkey.html
> FOAF: http://www.ivan-herman.net/foaf.rdf
>
>
>
>
>

-- 
Shane P. McCarron                          Phone: +1 763 786-8160 x120
Managing Director                            Fax: +1 763 786-8180
ApTest Minnesota                            Inet: shane@aptest.com
Received on Thursday, 10 March 2011 15:39:01 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:05:24 UTC