- From: Philipp Cimiano <cimiano@cit-ec.uni-bielefeld.de>
- Date: Thu, 06 Feb 2014 20:30:31 +0100
- To: Felix Sasaki <fsasaki@w3.org>, "John P. McCrae" <jmccrae@cit-ec.uni-bielefeld.de>
- CC: "public-ontolex@w3.org" <public-ontolex@w3.org>
- Message-ID: <52F3E2D7.9080607@cit-ec.uni-bielefeld.de>
Dear all,
thanks for all your input to the language coding issue.
I have now written the following in the document:
When specifying the language of a literal, in this document we adhere to
to Best Common Practice 5646
(http://www.rfc-editor.org/rfc/bcp/bcp47.txt). According to BCP 5646,
tags are made up of a language code (a three letter ISO 639-3 code or a
two letter ISO 639-1 code if available, see
http://www.iso.org/iso/home/standards/language_codes.htm) followed by a
hyphen and a ISO 3166-1 country code
(http://www.iso.org/iso/iso-3166-1_decoding_table.html).
We follow the convention of writing the language codes in lower case and
the country codes in upper case.
However, this is not part of the specification of this document; users
of the lexicon-ontology model can adopt any strategy to specify the
language, though we strongly recommend to follow BCP 5646.
I think this is in line with all your contributions.
Let me know otherwise.
Philipp.
Am 30.01.14 12:23, schrieb Felix Sasaki:
> Am 30.01.14 12:09, schrieb John P. McCrae:
>>
>>
>>
>> On Thu, Jan 30, 2014 at 7:47 AM, Philipp Cimiano
>> <cimiano@cit-ec.uni-bielefeld.de
>> <mailto:cimiano@cit-ec.uni-bielefeld.de>> wrote:
>>
>> Dear all,
>>
>> I am afraid I will not be able to attend the ontolex telco this
>> Friday. I will now work on the document, so please provide your
>> feedback by email.
>>
>> I would kindly ask you all to work on the sections in the
>> document assigned to you ;-)
>>
>> Other that that I wanted to clarify one issue regarding language
>> codes in the example.
>>
>> I have seen that some people (John?) have started to use the ISO
>> 639-2 codes (e.g. "ENG" for English, "SPA" for Spanish etc.).
>> I would propose we stick to the ISO 639-1 two-letter ISO 639-1
>> codes (e.g. "EN", "ES") etc. There is no particular reason for
>> this other than the fact that most people know these codes.
>>
>> Yes that would be me, I use the ISO 639-3 codes as they represent the
>> most complete and usable list of codes. At any rate, this is not part
>> of our standardization efforts and applications must support
>> well-formatted codes using any ISO standard
>>
>>
>> If the argument is recency and reusing the newest standard, then
>> we would have to go anyway for four letter codes according to ISO
>> 639-6.
>>
>> Erm 639-6 has a different purpose... it is not really appropriate
>> here (and is equal to 639-3 for standard languages anyway)
>>
>>
>> Regarding the particular versions of a language spoken in a
>> particular country, I recommend we follow the principle of IETF
>> tags which consists of the ISO code followed (if applicable) by a
>> hyphen and the ISO 3166-1 code of the country. Thus the variation
>> of English spoken
>> in the United States would be: "en-us" while the version of
>> English spoken in Great Britain would be "en-gb".
>>
>> There is a standard for this, namely RFC 5646
>
> Hi John, all,
>
> just to be picky, there is BCP 47 ("Best Common Practice") that
> defines language tags and matching of language tags. Various RFCs have
> been published about language tags, but the stable reference, that is
> "latest version" identifier for this, is always
> http://www.rfc-editor.org/rfc/bcp/bcp47.txt
> or in HTML http://tools.ietf.org/html/bcp47
> currently it says "Request for Comments: 5646" at the top (the languge
> tag part) and RFC 4647 later (the matching part). You can find the
> previous RFCs by clickling on the "obsoletes" links, e.g. "Obsoletes:
> 4646 <http://tools.ietf.org/html/rfc4646> "
>
> - Felix
>
>> , and we should follow that as with all RDF. (It does agree with your
>> proposal here though)
>>
>> Regards,
>> John
>>
>>
>> I hope this is fine for everyone. I will add this information to
>> the document.
>>
>> Regards,
>>
>> Philipp.
>>
>> --
>>
>> Prof. Dr. Philipp Cimiano
>>
>> Phone: +49 521 106 12249 <tel:%2B49%20521%20106%2012249>
>> Fax: +49 521 106 12412 <tel:%2B49%20521%20106%2012412>
>> Mail: cimiano@cit-ec.uni-bielefeld.de
>> <mailto:cimiano@cit-ec.uni-bielefeld.de>
>>
>> Forschungsbau Intelligente Systeme (FBIIS)
>> Raum 2.307
>> Universität Bielefeld
>> Inspiration 1
>> 33619 Bielefeld
>>
>>
>>
>
--
Prof. Dr. Philipp Cimiano
Phone: +49 521 106 12249
Fax: +49 521 106 12412
Mail: cimiano@cit-ec.uni-bielefeld.de
Forschungsbau Intelligente Systeme (FBIIS)
Raum 2.307
Universität Bielefeld
Inspiration 1
33619 Bielefeld
Received on Thursday, 6 February 2014 19:31:00 UTC