- From: Peter Occil <notifications@github.com>
- Date: Sun, 22 Nov 2015 00:42:25 -0800
- To: whatwg/encoding <encoding@noreply.github.com>
Received on Sunday, 22 November 2015 08:42:53 UTC
It seems that for some code points, gb18030 ranges doesn't have a round-trip mapping. Take the code point U+8000 for example. When we apply the the "index gb18030 ranges pointer" we get: 32768 ---> 18962, 0x4DAF --> 18962 + 32768 - 19887 --> 31843 but when we apply the "index gb18030 ranges code point" from 31843 we get: 31843 ---> 19043, 0x9FA6 --> 40870 + 31843 - 19043 --> 53670 and that differs from our original 32768. I think the reason is that each range is poorly defined; it's not clear where each range starts and ends in "index-gb18030-ranges.txt". --- Reply to this email directly or view it on GitHub: https://github.com/whatwg/encoding/issues/17
Received on Sunday, 22 November 2015 08:42:53 UTC