W3C home > Mailing lists > Public > xmlschema-dev@w3.org > May 2011

Re: Is every XML Schema validator guaranteed to support the same set of Unicode characters?

From: C. M. Sperberg-McQueen <cmsmcq@blackmesatech.com>
Date: Tue, 17 May 2011 09:18:06 -0600
Cc: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>, "xmlschema-dev@w3.org" <xmlschema-dev@w3.org>
Message-Id: <58DBECF8-9E9E-4247-B284-0B0D2069205B@blackmesatech.com>
To: "Costello, Roger L." <costello@mitre.org>

On May 16, 2011, at 12:54 PM, Costello, Roger L. wrote:

> Hi Folks,
> 
> 1. Is every XML Schema validator guaranteed to support the same set of Unicode characters? 

What does 'support' mean here?

> 
> For example, will the set of characters expressed by this regex:
> 
>     [\d\D]
> 
> be the same regardless of the XML Schema validator?

What does the spec say?

> 
> 2. Is every version of XML Schema guaranteed to support the same set of Unicode characters as all other versions?

What does the spec say about the reference to Unicode?

> 
> 3. Does XML determine the set of characters supported by XML Schema? That is, does XML Schema support the set of Unicode characters specified in the XML specification? 

In what way can the XML specification be said to specify a set of Unicode
characters?  What does your second sentence mean?

> 
> 4. If I use this regex in my XML Schema:
> 
>      [^0-9]*
> 
> Is there a risk that:
> 
> a. The set of strings described by the regex may vary, depending on the XML Schema validator (or an XML Schema application)?

How could this happen?  What rules might prevent it happening?

> 
> b. With different versions of XML Schema (e.g., XML Schema 1.0, XML Schema 1.1) the regex may describe different sets of strings?

Have you stopped beating your wife?

-- 
****************************************************************
* C. M. Sperberg-McQueen, Black Mesa Technologies LLC
* http://www.blackmesatech.com 
* http://cmsmcq.com/mib                 
* http://balisage.net
****************************************************************
Received on Tuesday, 17 May 2011 15:18:35 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 17 May 2011 15:18:35 GMT