W3C home > Mailing lists > Public > www-international@w3.org > January to March 2015

Re: [Bug 27851] Add MS932 as a label of Shift_JIS

From: 신정식 <jshin1987+w3@gmail.com>
Date: Wed, 21 Jan 2015 03:21:17 -0800
Message-ID: <CAE1ONj82FMyAsNNbaKV1GP8xS8Z7uyKZqsmk7b_-ACwvwARfow@mail.gmail.com>
To: Anne van Kesteren <annevk@annevk.nl>
Cc: Shawn Steele <Shawn.Steele@microsoft.com>, "www-international@w3.org" <www-international@w3.org>
On Wed, Jan 21, 2015 at 1:44 AM, Anne van Kesteren <annevk@annevk.nl> wrote:

> On Tue, Jan 20, 2015 at 9:07 PM, Shawn Steele
> <Shawn.Steele@microsoft.com> wrote:
> > To me it feels like encouraging bad behavior to make the bad behavior
> "standard".
>
> Basically:
>
> * There are less clients than content
> * It's easier to update clients than content
>
> There are some tradeoffs to how far we are willing to bend over
> backwards, but if we have data indicating adding a label could help a
> ton of users, there's no reason not to add it to all clients.
>
>
> > To further randomize, what are other folks doing about Zawgyi encoded
> Burmese text?  It's mangled Unicode and typically incorrectly declared as
> Unicode.  So it only works with the fonts that expect the non-Unicode
> encoding.
>
> I would expect such documents to include the font. Any examples?
>

There are lots of them. It's similar to pages in India and other South and
SE Asian countries(and eleswhere)  that declare to be in ISO-8859-1 or
windows-1252 but use 'font encoding' to represent scripts in those
countries.

Zawgy is  even worse than those mentioned above because it misuses 'UTF-8'
label. Its encoding structure is UTF-8, but Unicode codepoints in those
documents mean different things than in the Unicode. (BTW, I heard that
there are some Malayalam web sites that also 're-purpose' UTF-8 in a
similar way).

Even in a single document, the "encoding" can change depending on what font
is used for a DOM element.

I think it's better NOT to recognize / support them. Burmese is
well-supported in Unicode and we'd better encourage/promote the standard
way of representing Burmese/Myanmar.

There are extensions for Firefox and Chrome for on-the-fly transcoding of
them to Unicode.

https://addons.mozilla.org/en-US/firefox/addon/padma/
https://chrome.google.com/webstore/detail/padma/ngifghlmhidnielinpjdkkiadocdffbi
https://code.google.com/p/chromium/issues/detail?id=14948
https://bugs.webkit.org/show_bug.cgi?id=22339

For Zawgyi specific extensions, see
https://chrome.google.com/webstore/search/Zawgyi. Firefox should have
similar extensions.

Jungshik




>
> --
> https://annevankesteren.nl/
>
>
Received on Wednesday, 21 January 2015 11:21:44 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:38 UTC