W3C home > Mailing lists > Public > www-international@w3.org > October to December 2014

[Bug 16691] Fix euc-kr

From: <bugzilla@jessica.w3.org>
Date: Tue, 04 Nov 2014 22:06:49 +0000
To: www-international@w3.org
Message-ID: <bug-16691-4285-sPDD4pUXWP@http.www.w3.org/Bugs/Public/>
https://www.w3.org/Bugs/Public/show_bug.cgi?id=16691

--- Comment #14 from Jungshik Shin <jshin@chromium.org> ---
To me/Blink, it does not matter much how index file for EUC-KR is arranged
because we won't use the index file directly (we use it to generate an icu
mapping file). 

However, I found an important incompatibility between  
browsers using ICU on the one hand (Chrome, Opera, Safari) and Firefox on the
other hand when it comes to handling invalid/unassigned code points in legacy
encodings. 

When coming across '\xF0\x61' in EUC-KR/CP949, ICU emits U+FFFD for the two
byte sequence. Firefox emits U+FFFD followed by U+0061. And, that's what the
current encoding spec requires of Big5 (I found it the other day while making
the ICU mapping table for Big5 per the encoding spec). 

We need to reconcile this discrepancy. I'll file a separate bug.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Received on Tuesday, 4 November 2014 22:06:50 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:38 UTC