[Bug 28141] treatment of invalid 2-byte sequence is different in different CJK encodings

https://www.w3.org/Bugs/Public/show_bug.cgi?id=28141

Anne <annevk@annevk.nl> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED

--- Comment #6 from Anne <annevk@annevk.nl> ---
https://github.com/whatwg/encoding/issues/5 changed big5 to check the code
point rather than the pointer.

shift_jis had that problem too, but indeed, we should eat the trail byte for
shift_jis if it is not an ASCII byte.

euc-kr seems wrong too based on that.

gb18030 too.

So I fixed shift_jis, euc-kr, and gb18030.

https://github.com/whatwg/encoding/commit/640bf69847a17fd98df027fd6cd5ae384ac82dab

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Received on Wednesday, 19 August 2015 12:51:14 UTC