- From: Biron,Paul V <Paul.V.Biron@kp.org>
- Date: Mon, 16 Aug 2004 15:28:07 -0700
- To: "'www-xml-schema-comments@w3.org'" <www-xml-schema-comments@w3.org>
- Message-Id: <8E9F0028F5955844899380433C60E39902A0D2FC@cscrdemsg001.crdc.kp.org>
We need to open an issue for 2e and 1.1 regarding (possible) changes to xs:language necessary to align with XML 1.03e [1] and RFC 3066bis [2].
The easy one is that XML 1.03e allows the emtpy string as a legal value for xml:lang and is interpreted as "forget any info you knew about language from an ancestor" (note: 2e and before never explicitly allowed it but gave that impression). The definition we give for xs:language is:
<xs:simpleType name="language" id="language">
<xs:restriction base="xs:token">
<xs:pattern value="[a-zA-Z]{1,8}(-[a-zA-Z0-9]{1,8})*"/>
</xs:restriction>
</xs:simpleType>
So, to allow for the empty string we at least need to change this to:
<xs:simpleType name="language" id="language">
<xs:restriction base="xs:token">
<xs:pattern value="([a-zA-Z]{1,8}(-[a-zA-Z0-9]{1,8})*)?"/>
</xs:restriction>
</xs:simpleType>
The second issue is that RFC 3066 is being revised as we speak...the current draft of 3066bis is at [2]. 3066bis puts a number of contraints on the subtags. There has been some discussion with i18n regarding how exact lexical/value space of xs:language should be made relative to the EBNF in 3066bis [3,4]. At the very least they would like us to change our reference to "3066 or successors".
pvb
[1] http://www.w3.org/TR/2004/REC-xml-20040204/#sec-lang-tag
[2] http://www.ietf.org/internet-drafts/draft-phillips-langtags-05.txt
[3] http://lists.w3.org/Archives/Member/w3c-xml-schema-ig/2004Jul/0057.html
[4] http://lists.w3.org/Archives/Member/w3c-xml-schema-ig/2004Jul/0096.html
Received on Monday, 16 August 2004 22:35:41 UTC