W3C home > Mailing lists > Public > public-i18n-core@w3.org > October to December 2009

Re: HTML5 Issue 11 (encoding detection): I18N WG response...

From: Andrew Cunningham <andrewc@vicnet.net.au>
Date: Wed, 14 Oct 2009 13:02:42 +1100
Message-ID: <4AD53142.4090702@vicnet.net.au>
To: Leif Halvard Silli <xn--mlform-iua@xn--mlform-iua.no>
CC: Ian Hickson <ian@hixie.ch>, Henri Sivonen <hsivonen@iki.fi>, Maciej Stachowiak <mjs@apple.com>, Mark Davis ☕ <mark@macchiato.com>, Martin_J=2E_D=FCrst <duerst@it.aoyama.ac.jp>, "Phillips, Addison" <addison@amazon.com>, Richard Ishida <ishida@w3.org>, "public-html@w3.org" <public-html@w3.org>, "public-i18n-core@w3.org" <public-i18n-core@w3.org>, Larry Masinter <masinter@adobe.com>

Leif Halvard Silli wrote:

> Ian's table [1] right now says that "all other locales" should default 
> to Windows 1252. That would include localizations for those languages 
> you have in mind here also.
> [1] 
> http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#determining-the-character-encoding 
I'd be inclined to look at system locale as well as UI locale, and what 
encodings are supported by the application and OS. And for many 
languages added to windows since win2000, the languages are only 
supported by Unicode, no new code pages have been defined. Similarly on 
other platforms.

For some languages it may make sense to have a fall back encoding as a 
legacy encoding, where an appropriate legacy encoding is supported. But 
the reality is that web browsers don't support legacy encodings for many 
African, Central Asian, South Asian and South East Asian languages, and 
since the encoding support isn't there, the logical fall back encoding 
is UTF-8.


Andrew Cunningham
Senior Manager, Research and Development
State Library of Victoria
328 Swanston Street
Melbourne VIC 3000

Ph: +61-3-8664-7430
Fax: +61-3-9639-2175

Email: andrewc@vicnet.net.au
Alt email: lang.support@gmail.com


Received on Wednesday, 14 October 2009 02:03:21 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:23:05 UTC