W3C home > Mailing lists > Public > xml-editor@w3.org > October to December 2003

Treatment of U+2028

From: Nikolai Grigoriev <grig@renderx.com>
Date: Fri, 14 Nov 2003 00:26:20 +0300
Message-ID: <00cc01c3aa2c$d7f29c70$0d00a8c0@Madagascar>
To: <xml-editor@w3.org>

Dear XML editors,

One of the changes proposed for XML 1.1 is to include character U+2028 
(LINE SEPARATOR) into the list of characters subject to end-of-line
normalization. In Section 1.3, the rationale is simply:

XML 1.1>   For completeness, the Unicode line separator character, 
XML 1.1>   #x2028, is also supported [=converted to #xA].

The semantic of the character in Unicode is defined as follows:

Unicode>   2028  LINE SEPARATOR may be used to represent 
Unicode>    this semantic unambiguously

In an application that deals with typographical presentation, 
it may be desirable to have a character that indicates a line break - 
a character equivalent of <BR/>. U+2028 is a natural candidate
for this; but if it is suppressed in normalization, the application will 
not be able to reach it anymore.  

I believe that U+2028 should be exempt from the normalization,
for XML applications to be free to employ its Unicode semantics.
After all, the whole range of special Unicode space characters 
(U+00A0, U+2000 to U+200B) is preserved; the line separator
character is no different.
 
Best regards,
Nikolai Grigoriev
RenderX
Received on Thursday, 13 November 2003 16:26:52 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 23:39:46 UTC