W3C home > Mailing lists > Public > semantic-web@w3.org > July 2014

Re: More on: Should information be merged from several RDF files?

From: Martynas Jusevičius <martynas@graphity.org>
Date: Tue, 8 Jul 2014 03:01:10 +0200
Message-ID: <CAE35VmzUPVL7PMKNL+TsayYUwUYRVHdyi9yxEh=3YQ6obkTD+A@mail.gmail.com>
To: Victor Porton <porton@narod.ru>
Cc: SW-forum Web <semantic-web@w3.org>
>From reading your PDF, it seems like you're trying to do natively in
RDF what GRDDL [1] does using XSLT?
Why though? XSLT is perfectly good for it. It would be an interesting
excercise however to model XSLT in RDF, just like SPIN [2] models
SPARQL in RDF.

[1] http://www.w3.org/TR/grddl-primer/
[2] http://spinrdf.org/spin.html

Martynas
graphityhq.com

On Mon, Jul 7, 2014 at 11:17 PM, Victor Porton <porton@narod.ru> wrote:
> I am writing a program.
>
> I read RDF files while executing my program.
>
> After each RDF loaded, my program does some actions (and probably terminates).
>
> It is not predictable which RDF file will be loaded next, because in intervals between loading RDF files my program does some computations and the next loaded RDF file depends on these computation.
>
> As such, I cannot first load all RDF files and merge information in them. Instead of this I need to load RDF files one-by-one and update my program data structure after reading each RDF file.
>
> If I would read all RDF files at once I would be able just to merge data from all RDF files. But I cannot do that.
>
> Upon reading each RDF file, I update internal data structures of my program based on RDF triples loaded.
>
> I cannot base building these internal data structures of my program on the result of set-theoretic union of all RDF triples loaded till the moment. The reason for this is that loading an additional RDF may render my data inconsistent (if it has two or more different objects for a predicate which should have no more than on value, as in an example below). So this would require removal of some data from my program data structures, what would aimlessly complicate the code. I want only to add new data structures, not remove them, to make my program easier.
>
> So the only remaining option is to load RDF one-by-one and construct new internal data structures of my program based only on the last loaded RDF file (not all loaded RDF files together).
>
> A question remains:
>
> # file-1.rdf
> <http://example.com> <#property-which-can-have-only-one-value> 1 .
>
> # file-2.rdf
> <http://example.com> <#property-which-can-have-only-one-value> 2 .
>
> Let we load first file-1.rdf and then later file-2.rdf. Should the triple from file-2.rdf be ignored? Or should I construct a new data structure from the data of both files, as if the subject URLs in these files would be different?
>
> Here is my project, by the way:
> http://freesoft.portonvictor.org/namespaces.xml
>
> --
> Victor Porton - http://portonvictor.org
>
Received on Tuesday, 8 July 2014 01:01:37 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 1 March 2016 07:42:52 UTC