- From: Kevin Smathers <kevin.smathers@hp.com>
- Date: Mon, 30 Jun 2003 11:45:00 -0700
- To: "John S. Erickson" <john.erickson@hp.com>
- Cc: www-rdf-dspace@w3.org
John S. Erickson wrote: >Alberto made a huge point: > > > >>in our interpretation of provenance/contexts in RDFStore we assumed >>that a statement represents a fact that is asserted as true in a >>certain context. This circumstance (e.g. space/temporal, situation or >>scope) where the statement has been stated represents “contextual” >>information about the statement [1][2]. For example, when triples are >>being added to a graph it is often useful to be able to track back >>where they came from (e.g. Internet source Web site or domain), how >>they were added, by whom, why, when (e.g. date), when they will expire >>(e.g. Time-To-Live) and so on. Such context (or provenance information) >>can be thought of as an additional and orthogonal dimension to the >>other 3 components. This concept is not part of the current RDF data >>model [3] and referred to as “statement reification". From the >>application developer point of view there is a clear need for such >>primitive constructs to layer different levels of semantics on top of >>RDF which can not be represented in the RDF triples space.... >> >> > >JSE: The notion of preserving the context of a statement WITHOUT TRANSFORMING >THAT STATEMENT is critical for RDF application developers and I believe is >being overlooked. I believe RDF's current approach, which reifies the >statement, is artifically invasive and complex. > >In a real world, statements will be conceptually contained, aggregated and >nested; it seems crazy that in order to deal with them in such a way, we must >artificially blow them apart. > > > Aside from the issue of complexity, there are also other reasons not to transform original statements. Digital signatures lose their meaning if the data they refer to is transformed. But there are (at least) two ways of looking at solving the need for non-transformation of data. First you can query down to the original data through an RDF proxy of some sort that transforms the view of the original data into a form that is needed by the query processor. Second you can transport the original content as an attachment along with the transformed data. The transformation can then be rechecked at any time without affecting the original data, or requiring the use of a proxy (thus providing for consistent indexing and performance characteristics across all data sources.) The first type is what I was trying to get at when I proposed adding dynamic data sources to the Simile technologies document. I think that the second more closely maps to the ingest/publish process that we've discussed previously. Within the context of the second approach, I'm not sure it is important to RDF programmers whether we use quads or four-statement reification, since they won't see either of those expressions of their data themselves. Cheers, -kls -- ======================================================== Kevin Smathers kevin.smathers@hp.com Hewlett-Packard kevin@ank.com Palo Alto Research Lab 1501 Page Mill Rd. 650-857-4477 work M/S 1135 650-852-8186 fax Palo Alto, CA 94304 510-247-1031 home ======================================================== use "Standard::Disclaimer"; carp("This message was printed on 100% recycled bits.");
Received on Monday, 30 June 2003 14:46:22 UTC