- From: Xan Gregg <xan.gregg@jmp.com>
- Date: Wed, 3 Jan 2007 15:45:39 -0500
- To: "Michael Kay" <mike@saxonica.com>
- Cc: "'Tsao, Scott'" <scott.tsao@boeing.com>, <xmlschema-dev@w3.org>
I think Michael's list option won't work because the quoted string
items can contain whitespace. The all-in-one pattern requires
trailing whitespace, so I offer the following derivative which doesn't:
<xs:pattern value='("[^"]*"(\s+"[^"]*")*)?'/>
The final '?' is to allow the empty list (no items). Remove it if
that is not desired.
xan
On Jan 3, 2007, at 2:57 PM, Michael Kay wrote:
>
> Looks to me something like
>
> <xs:simpleType name="quotedString">
> <xs:restriction base="xs:string">
> <xs:pattern value='".*"'/>
> </xs:restriction>
> </xs:simpleType>
>
> <xs:simpleType name="listOfQuotedStrings">
> <xs:list itemType="quotedString"/>
> </xs:simpleType>
>
> or if you don't want to use a list type,
>
> <xs:simpleType name="listOfQuotedStrings">
> <xs:restriction base="xs:string">
> <xs:pattern value='(("[^"]*")\s+)*'/>
> </xs:restriction></xs:simpleType>
> </xs:simpleType>
>
> ...
>
>> -----Original Message-----
>> From: xmlschema-dev-request@w3.org
>>
>> ...
>>
>> I'm trying to design a W3C XML Schema type description for an
>> element containing an arbitrary number of quoted strings
>> separated by arbitrary whitespace. The contents of the
>> quoted items are themselves limited to alphanumerics,
>> whitespace, and common punctuation characters, excluding
>> embedded quote characters. (The double quote here is chosen
>> as an arbitrary delimeter and has no special significance.)
>>
>> Example:
>> "abc" "de f" "123_456"
>> "foo bar" "etc."
>>
>> I'm not aware of a "built-in" XML Schema type that can
>> support this representation directly. It also appears that
>> the W3C XML Schema "pattern"
>> facet (allowing the specification of a regular expression for a type
>> format) does not support the "non-greedy" quantifier syntax,
>> e.g., "*?", "+?" that is common in many regular expression engines.
>>
>> Can anyone suggest a regex to define this format without the
>> non-greedy quantifiers, or perhaps an XML Schema
>> representation that can handle this format directly?
>>
>
>
Received on Wednesday, 3 January 2007 20:48:00 UTC