Re: Hydra use case: Linked Data Fragments (ISSUE-30) from Ruben Verborgh on 2014-03-14 (public-hydra@w3.org from March 2014)

From: Ruben Verborgh <ruben.verborgh@ugent.be>
Date: Fri, 14 Mar 2014 13:26:18 +0000
To: Markus Lanthaler <markus.lanthaler@gmx.net>
Cc: public-hydra@w3.org
Message-Id: <3D83966D-45AB-44B2-8C4E-A21F98FAD064@ugent.be>
Hi Markus,

> So, in principle "basic Linked Data Fragments" are a more sophisticated and
> generalized ersion of Luca Matteis' Restpark API
> (http://lmatteis.github.io/restpark/), right? Are you aware of Restpark? I
> forgot it myself but after reading your mail I had a flashback :-)

Basic Linked Data Fragments share the URI template of Restpark.
I actually had a rather similar experience as you;
read about them and forgot until Luca pinged me.

However, whereas Restpark still has "query" in its terminology
(for instance, there is a limit parameter);
basic LDFs are really just specific fragment of a dataset
that can (and should) be interpreted separately from their application.
And that's where Hydra comes in:
my client might use fragments to solve SPARQL queries,
but other clients might do something completely different.

>>    :dbpedia void:subset <http://data-
>> cdn.linkeddatafragments.org/dbpedia?subject=&predicate=dbpedia-
>> owl%3AbirthPlace&object=dbpedia%3ANew_York>;
>>        hydra:search _:triplePattern.
> 
> This all makes perfect sense to me.. the only thing that you might wanna
> change (not sure) is to what hydra:search is attached to.  In this case
> here, I (as a client) would assume that you further query that Linked Data
> Fragment (instead of querying the whole DBpedia dataset).

The above are two distinct triples, right? So I'm saying that:

>> :dbpedia void:subset <…………>.
>> :dbpedia hydra:search _:triplePattern.

So this does capture the semantics that the whole dataset is searched?
I.e., would the client know that the query searches DBpedia, not the fragment?

> Really cool stuff. I see a lot of potential for this. It can be used to add
> extremely sophisticated querying to Hydra-powered Web APIs without
> (over)burdening the server as most other solutions do.


>> 1) How should a parameter be serialized in the URI template?
>> 
> [...]
>> 
>> How can I explain to clients which ones it can use,
>> which ones are the same and which ones are different?
> 
> Very good question. This is tracked as ISSUE-30, right?
> 
>  https://github.com/HydraCG/Specifications/issues/30

Exactly. I've added a link there to this use case.
Ideally, this use case is used as one test to see whether the issue is resolved.

> We can either define (and fix) how
> IRIs/literals are to be serialized or we add a mechanism to describe how
> they should be serialized.

That's it.
But… the full flexibility that this use case needs
will probably be overkill for many use cases.
So I'm afraid there will have to be a mechanism,
because few would want to go all-the-way.

As I've shown, for this use case it's crucial to distinguish
beween literals and URIs. It's a no-go to do anything else.
But it would probably be unreasonable to expect
that people will want to indicate this difference all the time.
(For instance, always have < > around URIs or "" around strings.)

> Allowing to describe the expected serialization
> format is much more flexible but makes the implementation of (primarily)
> clients more difficult.

What could work is "convention over configuration".
(But still allowing configuration.)

>> And of course, there would be many more ways to parse parameters.
>> I could live with only giving one that works for clients,
>> but it should be consistent and allow to differentiate between strings
>> and URIs.
> 
> Would be your preference or can you "just live with it"?

What I mean is that:
the server currently supports different ways to pass a URI.
You could abbreviate it with prefixes, or have to full URI in < >.
It would be totally fine with me if Hydra were only able
to explain just one of them, and not both.

But it would need to explain one of them.

> Do you think there
> are many cases where a variable can take both an IRI and a literal and the
> distinction is important?

No, in the majority of cases it won't be;
because there are few properties that could either take a URI or string.
rdf:object is actually one of the only ones.

But in this case, it is rdf:object I need.

> I kind of have troubles to find an example where that would matter…

In the LDF use case it does, hence my mail ;-)

I understand that a spec cannot be tailored to individual needs,
but LDF could be a big and compelling use case for Hydra.

What I would propose is something like:

   _:object hydra:variable "object";
       hydra:property rdf:object;
       hydra:serialization hydra:NodeSerialization.

Where hydra:NodeSerialization is a way that distinguishes
between IRIs, literals, blank nodes, and variables.

The default ("convention over configuration") could be hydra:TextualSerialization,
where the IRIs or literals as-is value is passed; losing the ability to distinguish.

Summarized: simple cases stay simple,
complex cases are supported and still simple.

>> 2) What do the subject, predicate, and object properties really mean?
> 
> My take on this would be to either specialize the IriTemplate class to
> something like a LdfIriTemplate or to specialize hydra:search.. something
> lik ldf:queryInterface.

That's an interesting option and I like it.
However, I wonder whether Hydra itself could also have
"collection search semantics" built in;
so a specialization of hydra:search that says
"and I will now return those element of the collection
 that directly have the specified property values”.

A discussion in this direction is here:
http://lists.w3.org/Archives/Public/public-hydra/2014Feb/0153.html

I think such a use case would be common enough
to justify its inclusion in Hydra.

> You could then even go as far as saying
> 
>  ldf:queryInterface a hydra:TemplatedLink ;
>    supportedOperation [
>      a ldf:RetrieveBasicLdfOperation ;
>      hydra:method "GET"
>      hydra:returns ldf:BasicLdf
>    ] .
> 
> (sorry, haven't looked up LDF vocabulary yet)

Neither have I :-)

I would also make it a subclass then of hydra:search;
or the more specific property in Hydra is we decide to create that one.

> I'm pretty excited about this as I really see a lot of potential. It would
> be interesting to see if a Hydra ApiDocumentation would provide enough
> information to dynamically "crawl" the data instead of querying it by SPO.
> Have you spent any thoughts on that already?

Oh! No I hadn't. Documentation is a very nice application area indeed. Thanks!

Best,

Ruben
Received on Friday, 14 March 2014 13:26:56 UTC