AW: AW: How to distinguish unique and non-unique prefLabels? from Neubert Joachim on 2011-01-13 (public-esw-thes@w3.org from January 2011)

From: Neubert Joachim <J.Neubert@zbw.eu>
Date: Thu, 13 Jan 2011 10:27:03 +0100
To: "Jakob Voss" <jakob.voss@gbv.de>, <public-esw-thes@w3.org>
Message-ID: <3A59BB6451C972429019B12996F92DAD02E4D7F2@frodo.zbw-nett.zbw-kiel.de>
Hi Jakob,

Thanks for pointing this out, I didn't get you fully in the first step.

dc:title surely is a valid modeling solution for distinguishing preferred from non-preferred non-unique labels, and its use looks quite intuitive also. (So I'll probably add it in the next STW version for the plain labels.) An application however will need this piece of special knowledge as well.

My arguing for the uniqueness asumption of skos:prefLabel had in mind more general applications, which deal with data (not necessarily SKOS concepts) of different structure. I like the idea to have a (mostly) unique label for every piece of data, to use it as a heading in a "record-like" display, for a generic way of sorting result sets, for creating a generic one-line result list entry, etc. For such use cases, it doesn't hurt much that skos:prefLabel is not *guaranteed* to be unique. (And I think it is wise that guaranteed uniqueness is not required here.)

By generating 

   skos:uniquePrefLabel "History (USA)"@en ;

in your example, you achieve the same ends. I think *how* to generate such unique labels depends heavily on the underlying data and the use cases you think about - and finally is a matter of taste, too. Of cause, no application should depend on the semantics of such generated labels, but get the needed information from more specific properties. So, in my point of view, there is no need to argue about "real" and "false" labels here ;)

Thanks for rising this discussion!

Cheers, Joachim


-----Ursprüngliche Nachricht-----
Von: public-esw-thes-request@w3.org [mailto:public-esw-thes-request@w3.org] Im Auftrag von Jakob Voss
Gesendet: Mittwoch, 12. Januar 2011 17:13
An: public-esw-thes@w3.org
Betreff: Re: AW: How to distinguish unique and non-unique prefLabels?

Hi Joachim,

Thanks for the feedback. I'd like to stress that my use-case is a 
classification, but not a thesaurus. You wrote:

> unique and non-unique skos:prefLabel subproperties seem a little bit
> counter-intuitive to me, since SKOS generally recommends unique use
> of skos:prefLabel.

I think the SKOS primer aims at thesauri and flat authority files with 
this recommendation. A classification generally has unique notation but 
in many cases the labels are not unique, and applications that deal with 
classifications, don't expect labels to be unique. If skos:prefLabel is 
not the right property for non-unique labels, then we'd need another 
property to distinguish alternative labels and the "preferred" 
alternative label. By the way http://dewey.info/ also uses 
skos:prefLabel. If you go down the whole DDC, there are non-unique 
labels too.

> This makes the skos:prefLabel property really valuable, far beyond
> the scope of thesauri, classifications and the like, because
> applications can assume uniqueness of skos:prefLabel for e.g. sorting
> or one-line display purposes of given resources.

Many use cases only need the guaranteed assumption, that each concept 
has only one preferred label by language. The assumption that each 
preferred label is unique in a given concept scheme, is not guaranteed 
by SKOS, so it should be validated by applications, that require it.

> In the systematic part of STW Thesaurus for Economics
> (http://zbw.eu/stw), I had to deal with the very same problem. There,
> I used rdfs:label as non-unique labeling property for the class name,
> and a concatination of notation and class name for skos:prefLabel.

You wrote:

<http://zbw.eu/stw/thsys/71085>
   rdfs:label "Development Policy"@en ;
   skos:prefLabel "V.08.02  Development Policy"@en  ;
   skos:notation "V.08.02"^^xsd:string .

Looks like a similar problem. But for general SKOS applications, 
rdfs:label is of even less use than skos:hiddenLabel, and it infers from 
*all* labeling properties, so you end up with many labels and no clue, 
which is the main label to use, if uniqueness is not relevant.

If concatenate notation and "real label", an application must have 
additional knowledge like "if the prefLabel starts with the notation, 
don't prepend the notation in displays, but look for rdfs:label for a 
label that does not contain the notation and that is intended to be used 
as preferred label, if uniqueness is not relevant". This is too much 
application-specific semantic in my point of view.

I need to distinguish between a label, that should be used, if 
uniqueness per scheme is not required, and a label, that should be used, 
if uniqueness per scheme is required. Both are given and there can be 
notations and alternative labels independent from that.

The case may be easier to solve, if the unique label has been created by 
adding a qualifier to the non-unique label, such as "V.08.02 
Development Policy" or "Development Policy (Economics)". Maybe SKOS-XL
can belp? Another solution would be using dc:title instead of rdfs:label:

<http://zbw.eu/stw/thsys/71085>
   dc:title "Development Policy"@en ;
   skos:notation "V.08.02"^^xsd:string .

The additional best-practice rule, that we should propagate for all SKOS 
applications, would be to use dc:title instead of skos:prefLabel, if 
uniqueness of labels is not relevant, or if no skos:prefLabel is given. 
If there are no unique labels, we should not lie and artificially create 
them in the data, but in the application that needs them (for instance 
by prepending the notation).

In short: if the concept's main labels in a concept scheme is not unique 
in this scheme (this is the fact in many classifications!), we either

A) have to use skos:prefLabel non-unique, or

B) let some concepts not have any skos:prefLabel property at all

Both does not contradict the SKOS specification. I am looking for the 
best practice to apply either A or B.

Jakob

-- 
Jakob Voß <jakob.voss@gbv.de>, skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de
Received on Thursday, 13 January 2011 09:30:48 UTC