Re: ISSUE-7 (whyAccessUrl): Drop dcat:accessUrl, use the URI of the dcat:Download resource instead [DCAT]

On 9 Feb 2012, at 15:31, Sarven Capadisli wrote:
> I agree with Ed Summers' proposal to drop dcat:accessURL and simply use dcat:distribution.

The problem is that accessURL is optional – some datasets are not distributed online but through other means. In that case, what would be used as the node representing the dcat:Distribution? A blank node? A made-up URI? So there needs to be a special case in both production and consumption code to deal with this case, and that just increases the probability that implementers with less RDF experience get it wrong.

> With the exception of allowing dcat:distribution to have an HTML page as one of the ranges, it is actually similar to void:dataDump [1].
> On that note, and perhaps this is a separate issue in on of itself, we should drop the HTML page for the downloads as a possibility.

The problem is that a number of catalogs don't distinguish between “direct download link” and “link to a web page that explains where and how to get the data”. A notable example is If we forbid the use of a HTML page as accessURL, then cannot use dcat without re-annotating their entire catalog.

> VoID's dataDump definition has a note on this that's worth considering:
> "The void:dataDump property should not be used for linking to a download web page. It should only be used for linking directly to dump files. This is to ensure that the link can be used by automated spiders that cannot find their way through an HTML page. If a publisher desires to provide a link to a download page as well, then they should use the foaf:page property instead."
> If the possible range is unclear or confusing for the publisher (as raised by Fadi Maali [2]), we should reconsider the term. I personally find something along the lines of "dataDump" to be far more clear for its intention than "download", "access" or "distribution".

There are many uses of accessURL that simply are not covered by the term “data dump”. Consider RSS feeds, SPARQL endpoints, PDF reports, …


> [1]
> [2]
> -Sarven

Received on Thursday, 9 February 2012 20:56:58 UTC