RE: [Encoding] false statement [I18N-ACTION-328][I18N-ISSUE-374] from Andrew Cunningham on 2014-08-28 (www-international@w3.org from July to September 2014)

From: Andrew Cunningham <lang.support@gmail.com>
Date: Fri, 29 Aug 2014 05:18:28 +1000
To: John C Klensin <john+w3c@jck.com>
Cc: wwwintl <www-international@w3.org>, Larry Masinter <masinter@adobe.com>, "Phillips, Addison" <addison@lab126.com>, Richard Ishida <ishida@w3.org>
Message-ID: <CAGJ7U-URJLLHEMu6XxjnLJLuWUHKZHH4QZSnoXBtfA-LaGP=LQ@mail.gmail.com>

On 29/08/2014 4:10 AM, "John C Klensin" <john+w3c@jck.com> wrote:

>
> The one solace here and the one I hope all involved can agree on
> (or have already) is that, with the exception of writing systems
> whose scripts have not yet been encoded in Unicode, everyone
> ought to be moving away from historical encodings and toward
> UTF-8 as soon as possible.  That is the real solution to the
> problem of different definitions and the issues they can cause:
> just move forward to Standard UTF-8 to get away from them and
> consider the present mess as added incentive.
>

Unfortunately, that ship has already sailed. UTF-8 already suffers from the
same problem. The term some of us use for it is pseudo-Unicode.

For some languages, a sizeable amount of content is in this category.

To add to the problem some handset (mobile/cell phone) and tablet
manufacturers have baked in pseudo-Unicode for specific languages.

As the expression goes, that ship has already sailed.

Andrew

Received on Thursday, 28 August 2014 19:18:55 UTC