- From: Fran輟is Yergeau <yergeau@ALIS.COM>
- Date: Wed, 24 Nov 1999 08:15:15 -0500
- To: "'Martin J. Duerst'" <duerst@w3.org>, "'Jane Hunter'" <jane@dstc.edu.au>
- Cc: "'Misha Wolf'" <misha.wolf@reuters.com>, "'www international'" <www-international@w3.org>, <xml-editor@w3.org>
This looks like a genuine error in the spec. 2.12 points to [IANA], which refers to the charset registry at IANA, not to the language code registry (also at IANA). The reference to the charset registry is needed, it is used in 4.3.3. The fix seems to be to expand the [IANA] entry in A.1 to also cover the lang code registry. Or to have a second entry for lang codes. I CC'd xml-editor@w3.org to get this erratum in the pipeline (which is currently blocked, pending the startup of the XML Core WG). Note that there is already an approved erratum that makes the paragraphs after production [38] non-normative (RFC 1766 is normative). -- Francois Yergeau > -----Message d'origine----- > De: Martin J. Duerst [mailto:duerst@w3.org] > Date: mercredi 24 novembre 1999 00:47 > タ: Jane Hunter > Cc: Misha Wolf; www international; yergeau@ALIS.COM > Objet: Re: [Moderator Action] Re: XML Language Identifier (fwd) > > > Forwarded by the list maintainer. Copying Francois Yergeau, > who will know whether this is a known problem or not. > > Regards, Martin. > > At 16:58 1999/11/10 -0500, Jane Hunter wrote: > > Thanks Misha, > > > > One reason why this is so confusing is that the link in the > XML Spec to IANA > > language identifiers takes you to IANA character sets: > > > > http://www.w3.org/TR/REC-xml#NT-LanguageID > > http://www.w3.org/TR/REC-xml#IANA > > > > jane > > > > > >But when I read through the XML spec it appears that you > can have either the > > > >2-letter language code or the IANA character set code - > not both. Can you > > > >confirm this and if its correct, why have they done this? > > > > > > You are confusing two quite different things: > > > > > > 1. Language. Each sentence, or word, or even character, > of an XML > > > document may have a different language, indicated > using an xml:lang > > > attribute, see: > > > http://www.w3.org/TR/REC-xml#sec-lang-tag > > > > > > 2. Character set encoding. An entire XML document must > be encoded the > > > same way. This is indicated using an encoding > declaration, see: > > > http://www.w3.org/TR/REC-xml.html#NT-EncodingDecl > > > If the XML document is encoded using UTF-8 or UTF-16 then the > > > encoding declaration may be omitted. > > > > > > >Is it possible to > > > >define all three attributes using xml:lang > > > > > > No. See above. > > > > > > > or do we need to define a new > > > >structure? > > > > > > No. See above. > > > > > > If you want more information, you may do any of the following: > > > > > > - Mail the W3C's public Internationalisation mailing list > > > (www-international@w3.org). > > > > > > - If you are employed by a member of the W3C, join the W3C's > > > Internationalisation Interest Group by mailing the W3C I18N IG > > > Chair, Martin D・st (duerst@w3.org). > > > > > > - If you are employed by a member of the W3C, join the W3C's > > > Internationalisation Working Group by getting your W3C > Advisory > > > Committee representative to mail the W3C I18N WG Chair, > > > Misha Wolf (misha.wolf@reuters.com). > > > > > > Misha > > > > > > [This mail was written using voice recognition software] > > > > > > > > > ----------------------------------------------------------------- > > > Visit our Internet site at http://www.reuters.com > > > > > > Any views expressed in this message are those of the individual > > > sender, except where the sender specifically states them to be > > > the views of Reuters Ltd. > > > > > > > > > > > > > > #-#-# Martin J. Du"rst, World Wide Web Consortium > #-#-# mailto:duerst@w3.org http://www.w3.org >
Received on Wednesday, 24 November 1999 11:49:59 UTC