[Bug 28156] Separate GBK and GB18030 even for decoding (toUnicode)

https://www.w3.org/Bugs/Public/show_bug.cgi?id=28156

Anne <annevk@annevk.nl> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |hsivonen@hsivonen.fi,
                   |                            |smontagu@smontagu.org

--- Comment #1 from Anne <annevk@annevk.nl> ---
I would have expected that treating them identically for decoding saves you a
decoding table. Or would you reuse that anyway?

They're treated identically because gbk is effectively a subset and for the
other encodings we've found that supersets leak. I think there might be some
anecdotal evidence here too, but not sure.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Received on Thursday, 12 March 2015 11:50:30 UTC