- From: Christian Chiarcos <christian.chiarcos@gmail.com>
- Date: Thu, 2 Feb 2023 12:32:15 +0100
- To: r12a <ishida@w3.org>
- Cc: Jorge Gracia del Río <jogracia@unizar.es>, public-bpmlod@w3.org, Addison Phillips <addisonI18N@gmail.com>, Felix Sasaki <felix@sasakiatcf.com>
- Message-ID: <CAC1YGdgQjzNkDPE4zaQvvCyCQM=waKQQmyx+QwTEZH4q3RH5Hw@mail.gmail.com>
Am Do., 2. Feb. 2023 um 11:37 Uhr schrieb r12a <ishida@w3.org>: > hello Christian, > > BCP47 does in fact represent all the ISO 632-3 languages. > You're right, ISO 639-3 had been added. What led to my confusion is the following paragraph in the spec: "When a language has no ISO 639-1 two-character code and the ISO 639-2/T (Terminology) code and the ISO 639-2/B (Bibliographic) code for that language differ, only the Terminology code is defined in the IANA registry." I think that should be updated to reflect where ISO 639-3 stands in the ranking. It is based on ISO 639-2/T, but has a considerable number of internal redefinitions. For example, I have the strong impression that ISO 632-2 frs ("East Frisian") referred to a dialect of the Frisian language (the Saterland dialect) whereas ISO 632-3 frs ("East Frisian" refers to a dialect of Low German spoken in Eastern Frisia). I'm thinking this because ISO 632 subclassifies Frisian into its member branches, but doesn't do that for Low German ("nds"). ISO 639-3 subclassifies Low German ("nds", "wep", "frs", "pdt", numerous codes for Low German in the Netherlands, and, along the way, redefines the "nds" language tag to everything not assigned to a particular dialect), and this actually seems to be triggered by the existence of a separate language code for East Frisian (this is hinted at by https://iso639-3.sil.org/code/frs *officially* pointing to https://en.wikipedia.org/wiki/East_Frisian_Low_Saxon). Anyway, this example neatly illustrates that leveraging ISO language codes from different institutions and highly underspecified documentation (esp. for ISO 639-2 and ISO 639-1, which only give the name) ends up in something rather messy. In consequence, some people produce invalid BCP47-alike tags by just using ISO639-3 tags or URIs all the time. This was done in the GOLD ontology, for example (seems to be offline, but cf. the second entry under https://lov.linkeddata.es/dataset/lov/terms?q=prefix*). Best, Christian
Received on Thursday, 2 February 2023 11:32:40 UTC