Re: News from lingvoj.org

Bernard Vatant wrote:
> Hello all
> 
> I've started refreshing the content of pages at http://lingvoj.org, 
> which was long overdue.
> The data set now links to DBpedia (of course), Freebase and OpenCyc 
> (which had been broken since OpenCyc had changed its URIs a while ago). 
> But there are no backwards links from those yet... so far the only 
> dataset that I know linking to lingvoj.org is Linked MDB, but it does 
> not use the correct URIs [1]. :'( Please don't be shy! lingvoj.org URIs 
> are "cool" : stable, simple, and dereferencable. ;-)
> Bandwidth permitting, I will now focus on data quality, for the most 
> used language (ISO 639-1 list).

Hi Bernard,

Great news! Re. interlinking: I'm linking to lingvoj in the university project I'm currently working on. Should it ever get into a publishable state I will show you right away. ;-)

In this same project I have a minor problem with the lingvoj data: I'm using the labels in your dataset so I can display names of the languages in the user's language in the user interface. However the Wikipedia article names you use for those labels don't always lend themselves that nicely for user interface labels. Very often they say something like "English language" to distinguish the article from the one about the attribute "English" in general. But then in my interface it would say: "Language: English language". However it's really just a minor annoyance and I'm not sure what to do about it either. Sure there are infoboxes which contain the shorter name but they are different for every language version of Wikipedia and not all versions have them so this doesn't work as a general mechanism for extracting the labels. Maybe other data sources have better labels. CLDR [1]? But that's not free, is it?

Regards,
  Simon

[1] http://cldr.unicode.org/

Received on Thursday, 2 April 2009 10:36:42 UTC