- From: Henry S. Thompson <ht@cogsci.ed.ac.uk>
- Date: Fri, 19 Dec 2003 08:53:25 +0000
- To: "Jack Lindsey" <tuquenukem@hotmail.com>
- Cc: xmlschema-dev@w3.org
"Jack Lindsey" <tuquenukem@hotmail.com> writes:
> I have been asked why I don't validate xml:lang in my schemas to
> restrict its values to something like en-GB, en-US, en-Ca, fr-CA,
> es-MX. But if I roll my own or derive something by restriction in my
> namespace, it may be my:lang but it is no longer xml:lang. But for so
> many internationalization/localization reasons everyone wants the
> instant recognition and standardization of xml:lang. Apart from using
> Schematron or depending on application program logic, are there any
> other useful strategies in this area?
Just what are you trying to rule out? The regulatory situation
regarding language codes, as spelled out in RFC 3066 [1], is
sufficiently complicated that the lexical space constraint given in
the schema REC (as amended) for the xs:language type [2] is really the
strictest it's practical to enforce. With IANA having registered
e.g. cel-gaulish and de-AT-1901 as legal tags, there's really not much
we can do here.
> I love this, from "http://www.w3.org/2001/xml.xsd"
<snip/>
The comment will be removed when the above-cited erratum is formally
encorporated in the 2nd edition of the Schema REC.
ht
[1] http://www.ietf.org/rfc/rfc3066.txt
[2] http://www.w3.org/2001/05/xmlschema-errata#e2-25
--
Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
Half-time member of W3C Team
2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
URL: http://www.ltg.ed.ac.uk/~ht/
[mail really from me _always_ has this .sig -- mail without it is forged spam]
Received on Friday, 19 December 2003 03:54:24 UTC