W3C home > Mailing lists > Public > public-html@w3.org > February 2010

Re: what's the language of a document ?

From: John Cowan <cowan@ccil.org>
Date: Mon, 8 Feb 2010 12:01:33 -0500
To: Henri Sivonen <hsivonen@iki.fi>
Cc: Aryeh Gregor <Simetrical+w3c@gmail.com>, Andrew Cunningham <andrewc@vicnet.net.au>, Ian Hickson <ian@hixie.ch>, "www-international@w3.org" <www-international@w3.org>, public-html@w3.org
Message-ID: <20100208170133.GA1473@mercury.ccil.org>
Henri Sivonen scripsit:

> Moreover, making validators emit a message (of any kind) about the
> absence of a language declaration is likely to lead to authoring tools
> putting in a placeholder in order to silence validators. As a result,
> at least "en" and "en-US" can often be taken to mean "placeholder".

Quite so.  Google, at least, explicitly disregards "en"-based language
tags when determining the language of a web page for search purposes.
Other language tags are accepted as evidence, but are not treated as

John Cowan    http://ccil.org/~cowan    cowan@ccil.org
Economists were put on this planet to make astrologers look good.
        --Leo McGarry
Received on Monday, 8 February 2010 17:02:14 UTC

This archive was generated by hypermail 2.3.1 : Thursday, 29 October 2015 10:15:58 UTC