Re: what's the language of a document ?

Note that ISO 639-2 language tags formally distinguish an undetermined  
language ('und'), and multiple languages ('mul').

On Oct 25, 2009, at 16:38 , Divya Manian wrote:

> Internationalization best practices [1] states:
>
> “Where a document contains content aimed at speakers of more than one
> language, use Content-Language with a comma-separated list of language
> tags.”
>
> The HTML 5 specs [2] state:
>
> “…there is a document-wide default language set, then that is the  
> language
> of the node.
>
> If there is no document-wide default language, then language  
> information
> from a higher-level protocol (such as HTTP), if any, must be used as  
> the
> final fallback language. In the absence of any language information,  
> the
> default value is unknown (the empty string).”
>
> What is not clear is, what happens if a HTML document has a HTTP  
> header
> Content-Language has a comma-separated list of language tags and no  
> other
> language declarations? I found on a thread [3] that states such a  
> document
> will be declared to use "unknown" language in this case. It would be  
> good to
> have this case explicitly stated.
>
> Regards,
> Divya
> http://nimbu.in
>
>
> [1] http://www.w3.org/TR/i18n-html-tech-lang/#ri20040728.121940236
> [2] http://is.gd/4AsFD
> [3] http://lists.w3.org/Archives/Public/public-html/2008Nov/0091.html
>
>
>
>

David Singer
Multimedia and Software Standard, Apple Inc.

Received on Monday, 26 October 2009 01:36:54 UTC