- From: Bernard Vatant <bernard.vatant@mondeca.com>
- Date: Thu, 02 Apr 2009 19:46:43 +0200
- To: Simon Reinhardt <simon.reinhardt@koeln.de>
- CC: public-lod@w3.org
Hi Simon Thanks for the feedback. You (or someone else) already mentioned a while ago this extra "language" in labels. It's true in other languages too. It's on my to-do list to clean the data in order to get rid of those, and/or to use skos:prefLabel for "English" and skos:altLabel for "English language", or maybe the other way round, did not make my mind completely, I've to think about it. Any suggestions welcome. :-) Bernard Simon Reinhardt a écrit : > Bernard Vatant wrote: >> Hello all >> >> I've started refreshing the content of pages at http://lingvoj.org, >> which was long overdue. >> The data set now links to DBpedia (of course), Freebase and OpenCyc >> (which had been broken since OpenCyc had changed its URIs a while >> ago). But there are no backwards links from those yet... so far the >> only dataset that I know linking to lingvoj.org is Linked MDB, but it >> does not use the correct URIs [1]. :'( Please don't be shy! >> lingvoj.org URIs are "cool" : stable, simple, and dereferencable. ;-) >> Bandwidth permitting, I will now focus on data quality, for the most >> used language (ISO 639-1 list). > > Hi Bernard, > > Great news! Re. interlinking: I'm linking to lingvoj in the university > project I'm currently working on. Should it ever get into a > publishable state I will show you right away. ;-) > > In this same project I have a minor problem with the lingvoj data: I'm > using the labels in your dataset so I can display names of the > languages in the user's language in the user interface. However the > Wikipedia article names you use for those labels don't always lend > themselves that nicely for user interface labels. Very often they say > something like "English language" to distinguish the article from the > one about the attribute "English" in general. But then in my interface > it would say: "Language: English language". However it's really just a > minor annoyance and I'm not sure what to do about it either. Sure > there are infoboxes which contain the shorter name but they are > different for every language version of Wikipedia and not all versions > have them so this doesn't work as a general mechanism for extracting > the labels. Maybe other data sources have better labels. CLDR [1]? But > that's not free, is it? > > Regards, > Simon > > [1] http://cldr.unicode.org/ > > -- *Bernard Vatant *Senior Consultant Vocabulary & Data Engineering Tel: +33 (0) 971 488 459 Mail: bernard.vatant@mondeca.com <mailto:bernard.vatant@mondeca.com> ---------------------------------------------------- *Mondeca** *3, cité Nollez 75018 Paris France Web: www.mondeca.com <http://www.mondeca.com> Blog: Leçons de Choses <http://mondeca.wordpress.com/> ----------------------------------------------------**
Received on Thursday, 2 April 2009 17:47:41 UTC