- From: Gavin Nicol <gtn@eps.inso.com>
- Date: Thu, 10 Apr 1997 09:31:33 -0400
- To: murata@apsdc.ksp.fujixerox.co.jp
- CC: w3c-sgml-wg@w3.org
>The two new character code standards of JIS are expected to be widely >used as part of SJIS. Then, if XML is restricted to the BMP, conversion >from SJIS text to XML leads to loss of information. This is not necessarily true. You can generate an XML instance that is entirely encoding in ASCII that contains every single character from the BMP, plus characters not in the BMP. The *information* would all be there, but the representation might be "not optimal". Some people would argue that shift-jis is not exactly a good encoding...
Received on Thursday, 10 April 1997 09:33:00 UTC