Re: Liaison to W3C Provenance Working Group

Hi Kai,

On 27 Sep 2011, at 10:41, Kai Eckert wrote:
> As you already said you would function as such a bridge person, if noone else volunteers, it would be great if you could do it and write a short description of the RDF WG, its goals, especially if they are in the context of provenance.

I have discussed this with the RDF-WG chairs, and gladly agree to act as a bridge person between Prov-WG and RDF-WG.

A description of RDF-WG and its provenance-related activities, goals and materials is attached below. Feel free to use this for the Prov-WG report as you see fit.

All the best,


RDF Working Group

One of the work items on the RDF WG's charter [1] is to “Standardize a model and semantics for multiple graphs and graphs stores”. This is expected to unify approaches such as “named graphs”, “quoted graphs”, “graph literals”, “quads” and so on.

These features are widely used in RDF deployments to model the provenance of information recorded in RDF. Indeed, the working group's collection of use cases for this “multigraphs support” identifies several provenance-related use cases [2]. The ability to address provenance-related use cases will be an important benchmark in evaluating proposals for this work item.

At the time of writing, a straw-man proposal, directly based on the notion of “RDF datasets” in SPARQL, is included in the Editor's Draft of the updated RDF Concepts document [3]. Being an Editor's Draft, it is a work in progress and does not have consensus within the WG. The section also contains links into the RDF-WG issue tracker where relevant mailing list discussion is archived.

The interest of the working group is limited to provenance of RDF data. Given the positioning of RDF as a web technology, the main focus is on provenance of RDF data on the web, but this is not exclusive as RDF is also used “behind the firewall” in enterprise information integration, knowledge representation and other scenarios.

Relevant output of the working group is likely to include an extension of the RDF data model to support multiple RDF graphs, with an account of the formal semantics of the extension. To actually address provenance use cases, most likely additional RDF vocabulary will be required. The RDF WG is unlikely to standardize such vocabularies, as no new vocabularies are listed in its chartered deliverables, although addition of new terms to RDF Schema [4] would not be impossible given the charter. The RDF WG is more likely to defer the creation of such vocabulary to other or future working groups, most of all the W3C Provenance Working Group [5], but also the Government Linked Data WG [6] which has work items on describing best practices for versioning, and on recommending/blessing (or standardizing) a metadata vocabulary suitable for provenance.

It is likely that the working group will also standardize a related syntax for multiple graphs, which may end up being similar to existing proposals such as TriG [7], Notation 3 [8] or N-Quads [9]. Other syntaxes may be retrofitted with multigraph support.

[1] http://www.w3.org/2011/01/rdf-wg-charter
[2] http://www.w3.org/2011/rdf-wg/wiki/TF-Graphs-UC#Provenance_Use_Cases
[3] http://dvcs.w3.org/hg/rdf/raw-file/default/rdf-concepts/index.html#section-multigraph
[3] http://www.w3.org/TR/rdf-schema/
[4] http://www.w3.org/2011/prov/
[5] http://www.w3.org/2011/gld/charter#deliverables
[6] http://www4.wiwiss.fu-berlin.de/bizer/trig/
[7] http://www.w3.org/TeamSubmission/n3/
[8] http://sw.deri.org/2008/07/n-quads/

