W3C home > Mailing lists > Public > w3c-sgml-wg@w3.org > April 1997

Re: New work item for XML group ? (Re: Comments on 31 March spec)

From: Gavin Nicol <gtn@eps.inso.com>
Date: Thu, 10 Apr 1997 09:31:33 -0400
Message-Id: <199704101331.JAA00791@nathaniel.ebt>
To: murata@apsdc.ksp.fujixerox.co.jp
CC: w3c-sgml-wg@w3.org
>The two new character code standards of JIS are expected to be widely 
>used as part of SJIS.  Then, if XML is restricted to the BMP, conversion 
>from SJIS text to XML leads to loss of information.

This is not necessarily true. You can generate an XML instance that
is entirely encoding in ASCII that contains every single character
from the BMP, plus characters not in the BMP. The *information*
would all be there, but the representation might be "not optimal".

Some people would argue that shift-jis is not exactly a good encoding...
Received on Thursday, 10 April 1997 09:33:00 EDT

This archive was generated by hypermail pre-2.1.9 : Wednesday, 24 September 2003 10:04:24 EDT