W3C home > Mailing lists > Public > public-gld-wg@w3.org > November 2011

Re: Dataset catalog vocabulary: Why no dct:language in DCAT?

From: John Erickson <olyerickson@gmail.com>
Date: Tue, 22 Nov 2011 10:06:43 -0500
Message-ID: <CAC1Gg8SNAEGWM=njKQ=qiSq20AQoJ55sCQ+e2i=nhTVfLmYhPw@mail.gmail.com>
To: Richard Cyganiak <richard@cyganiak.de>
Cc: public-gld-wg@w3.org
RE adding it in, I would be happy to (please point me there).

FWIW, in the 116 catalogs whose DCAT-ish metadata we've aggregated in
IOGDS, we're seeing around 16 languages. That is not definitive; since
we are "scraping," it is a judgement call. This is why we need
authoritative sources generated catalog metadata! ;)

Also: the advice I've given our team is that dct:language should be at
both the catalog and dataset level --- priority is the catalog --- and
should allow multiple values (we've done the same thing with
dct:subject when converting Data.gov datasets with lists of
"keywords," one subject -> keyword).

I'll note all of this in the tracker as you've requested...

John

On Tue, Nov 22, 2011 at 9:34 AM, Richard Cyganiak <richard@cyganiak.de> wrote:
> John,
>
> On 15 Nov 2011, at 16:38, John Erickson wrote:
>> I tweeted this earlier, but I'll go old skool too:
>>
>> Does anyone know why dct:language is not part of #DCAT <http://t.co/loHgJejt>
>>
>> @cygri any thoughts?
>
> It didn't come up enough in our original review of data catalogs.
>
> That review was very much focused on English-language catalogs because that's pretty much all that was around at that time. With Open Data spreading to more and more countries, and aggregated catalogs becoming more prevalent, I think there's a good case for adding it.
>
> John would you mind raising an issue for this in the tracker?
>
> Best,
> Richard
>
>
>
>>
>> The reason I'm asking is mostly curiosity; we're refining our IOGDS
>> <http://t.co/7HPOldN> and will be adding dct:language the Catalog
>> level, first for statistics purposes and then as a browsable facet.
>> dct:language is in our metadata model, but as an "enhancement" that
>> gets added after scraping and first-pass conversion. The person doing
>> that work has noted the predominant languages (approx 16 across 116
>> catalogs from over 36 countries and international organizations) but
>> we don't have it "in the graph" (yet)...
>>
>> --
>> John S. Erickson, Ph.D.
>> Director, Web Science Operations
>> Tetherless World Constellation (RPI)
>> <http://tw.rpi.edu> <olyerickson@gmail.com>
>> Twitter & Skype: olyerickson
>>
>
>



-- 
John S. Erickson, Ph.D.
Director, Web Science Operations
Tetherless World Constellation (RPI)
<http://tw.rpi.edu> <olyerickson@gmail.com>
Twitter & Skype: olyerickson
Received on Tuesday, 22 November 2011 15:07:13 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 20:32:34 UTC