Re: [whatwg/encoding] If gb18030 is revised, consider aligning the Encoding Standard (#27)

GB18030-2020 will take effect on 1 Aug 2023. Compliance criteria include, at a minimum, not generating PUA characters the 24 characters for input methods and not using PUA codepoints for fonts.

However, most existing products sold on the Chinese market fail these tests and those old versions will still expected to be used. Also there's existing UTF-8 content which are using those PUA codepoints.

To be backwards compatible, both the PUA and the non-PUA codepoints should map to the correct 2-byte sequences.

Whether or not the 2-byte and 4-byte sequences should map to the non-PUA codepoints is less of an issue -- if keeping the double mapping to U+3000 is deemed web compatible, then keeping the four-byte sequences mapped to the non-PUA codepoints should also be web compatible in the same manner.

-- 
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/encoding/issues/27#issuecomment-1294978231
You are receiving this because you are subscribed to this thread.

Message ID: <whatwg/encoding/issues/27/1294978231@github.com>

Received on Friday, 28 October 2022 13:07:03 UTC