Re: [whatwg/encoding] index-jis0208.txt should be JIS X 0208 and add another index file (#47)

> I vote for splitting the table index-jis0208.txt into two parts, one for the indices < 8836 (the actual JIS X 0208 matrix) and one for the indices >= 8836 (the CP932 additions by Microsoft).

I don't think it makes sense because windows-932 extensions are not simple additions to indices >= 8836. Indices < 8836 contain not only genuine JIS X 0208 characters but also NEC special characters and NEC selection of IBM extensions. Moreover, some windows-932 mappings are incompatible with genuine JIS X 0208 mappings. For example, windows-932 will map index 32 to U+FF5E while JIS X 0208 will map it to U+301C. It will only complicate things to split the index file into two parts.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/encoding/issues/47#issuecomment-251083712

Received on Monday, 3 October 2016 11:27:22 UTC