- From: Ashok Malhotra <ashokma@microsoft.com>
- Date: Mon, 20 Oct 2003 05:32:20 -0700
- To: "Martin Duerst" <duerst@w3.org>, <w3c-i18n-ig@w3.org>, <w3c-xml-query-wg@w3.org>
- Cc: <public-qt-comments@w3.org>, <msm@w3.org>
Martin:
The refernce below includes:
"2.2. Alignment of references
XML Schema and the Functions and Operators spec should refer to the
same version of Unicode. At the moment, this appears not to be
true."
>>The XML Schema WG asked that the references to Unicode be consistent.
>>See
>><http://lists.w3.org/Archives/Public/public-qt-comments/2003Aug/0003.h
>>tml> h
>>ttp://lists.w3.org/Archives/Public/public-qt-comments/2003Aug/0003.htm
>>l
All the best, Ashok
> -----Original Message-----
> From: Martin Duerst [mailto:duerst@w3.org]
> Sent: Sunday, October 19, 2003 7:26 AM
> To: Ashok Malhotra; w3c-i18n-ig@w3.org; w3c-xml-query-wg@w3.org
> Cc: public-qt-comments@w3.org; msm@w3.org
> Subject: Re: Unicode references
>
> Some additions:
>
> At 16:09 03/10/17 -0400, Martin Duerst wrote:
>
> >At 08:28 03/10/16 -0700, Ashok Malhotra wrote:
>
> >>The XML Schema WG asked that the references to Unicode be
consistent.
> See
> >><http://lists.w3.org/Archives/Public/public-qt-
> comments/2003Aug/0003.html>
> >>h ttp://lists.w3.org/Archives/Public/public-qt-
> comments/2003Aug/0003.html
> >
> >I have looked through this document. I have not found any such
request.
> >(I have copied Michael Sperberg-McQueen, maybe he can help)
> >The closest I have found is
> >http://www.w3.org/XML/Group/2003/07/xmlschema-fo-comments.html#d0e327
> >This refers to the fact that Unicode 2.0 and Unicode 3.0 do not
> >clearly outlaw encoding of non-BMP characters in six bytes (using
> >two surrogate codepoints).
> >
> >If the intent of the XML Schema WG was to request that XML Query
> >should in any way tolerate such encoding, then this would not
> >be appropriate. XML 1.0 references RFC 2279 (see
> >http://www.w3.org/TR/REC-xml#sec-external-ent), which is already
> >clear that non-BMP characters have to be encoded as four bytes.
> >This would also be a bad idea given that there are known security
> >problems connected with overlong UTF-8 byte sequences,
>
> Please note the erratum
> http://www.w3.org/XML/xml-V10-2e-errata#E27, which also makes
> clear that XML doesn't allow overlong/irregular encodings.
>
>
> Regards, Martin.
Received on Monday, 20 October 2003 08:32:25 UTC