- From: Misha Wolf <misha.wolf@reuters.com>
- Date: Wed, 10 Nov 1999 16:20:29 +0000 (GMT)
- To: Jane Hunter <jane@dstc.edu.au>, dc international <dc-international@mailbase.ac.uk>, www international <www-international@w3.org>
Jane Hunter <jane@dstc.edu.au> wrote: >The MPEG-7 people want to be able to describe the language of the MPEG-7 >description using: > - Language Code - code from ISO 639, RFC 1766 > - Country Code - code from ISO 3166 > - Character Set - IANA identifier RFC 1766 language tags are made up of an ISO 639 language code plus an optional ISO 3166 country code, eg "en-us", "en-gb", etc. >Ideally this would be covered using the XML Language identifier: >http://www.w3.org/TR/REC-xml#sec-lang-tag The XML Language identifier (xml:lang) uses RFC 1766 language tags. >But when I read through the XML spec it appears that you can have either the >2-letter language code or the IANA character set code - not both. Can you >confirm this and if its correct, why have they done this? You are confusing two quite different things: 1. Language. Each sentence, or word, or even character, of an XML document may have a different language, indicated using an xml:lang attribute, see: http://www.w3.org/TR/REC-xml#sec-lang-tag 2. Character set encoding. An entire XML document must be encoded the same way. This is indicated using an encoding declaration, see: http://www.w3.org/TR/REC-xml.html#NT-EncodingDecl If the XML document is encoded using UTF-8 or UTF-16 then the encoding declaration may be omitted. >Is it possible to >define all three attributes using xml:lang No. See above. > or do we need to define a new >structure? No. See above. If you want more information, you may do any of the following: - Mail the W3C's public Internationalisation mailing list (www-international@w3.org). - If you are employed by a member of the W3C, join the W3C's Internationalisation Interest Group by mailing the W3C I18N IG Chair, Martin Dürst (duerst@w3.org). - If you are employed by a member of the W3C, join the W3C's Internationalisation Working Group by getting your W3C Advisory Committee representative to mail the W3C I18N WG Chair, Misha Wolf (misha.wolf@reuters.com). Misha [This mail was written using voice recognition software] ----------------------------------------------------------------- Visit our Internet site at http://www.reuters.com Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Reuters Ltd.
Received on Wednesday, 10 November 1999 11:20:35 UTC