W3C home > Mailing lists > Public > www-international@w3.org > January to March 2013

Re: I18N-ISSUE-247: Clarify interpretation of line breaks when calculating storage size [ITS-20]

From: Anne van Kesteren <annevk@annevk.nl>
Date: Fri, 29 Mar 2013 11:43:01 +0000
Message-ID: <CADnb78hLr9A5nTHC=6ON+dkj+YLL5FQt52RGskNZFFyVSGN13w@mail.gmail.com>
To: Jirka Kosek <jirka@kosek.cz>
Cc: "public-multilingualweb-lt@w3.org" <public-multilingualweb-lt@w3.org>, www-international@w3.org
On Fri, Mar 29, 2013 at 11:37 AM, Jirka Kosek <jirka@kosek.cz> wrote:
> OK I see where you going. What about adding something like:
>
> "For purposes of storage size calculations ITS processor MUST behave as
> if line ends were normalized accordingly to
> http://www.w3.org/TR/REC-xml/#sec-line-ends (or to
> http://www.w3.org/TR/xml11/#sec-line-ends if XML 1.1 is used) and only
> LINE FEED (U+000A) character is then considered as a line break."
>
> So for XML 1.0 U+0085 will not be considered as a line break, same for
> &#x0D;

I don't  really see how that follows. Are you storing XML converted to
bytes or a set of bytes you arrived at after parsing the XML and
taking the text content of the relevant element? The latter will not
contain &#x0D; unless your input was &amp;#x0D;.


-- 
http://annevankesteren.nl/
Received on Friday, 29 March 2013 11:43:32 UTC

This archive was generated by hypermail 2.3.1 : Friday, 29 March 2013 11:43:33 UTC