- From: Andy Seaborne <andy@apache.org>
- Date: Fri, 03 Jan 2014 21:10:25 +0000
- To: Gregg Kellogg <gregg@greggkellogg.net>
- CC: public-propertygraphs@w3.org
On 03/01/14 18:25, Gregg Kellogg wrote: > On Jan 3, 2014, at 1:55 AM, Andy Seaborne <andy@apache.org> wrote: > >> Gregg looked at using RDF to encode property graphs (PG), using bnode properties for links with attributes. [1] >> >> What about other way round - mapping RDF to PG? >> >> In the PropertyGraphs data model there are vertexes, links (edges, relationalships) and attributes (named values for a vertex and edges). >> >> This is similar to XML (elements and attributes) and OWL (object properties and data type properties). >> >> 2 features of RDF make the mapping from RDF into PG easy: >> >> 1/ Literals only appear in the object position. >> 2/ Properties don't have qualifiers. >> >> Blank Nodes can be treated some kind of label to be distinguished from URIs. We write use the pseudo URI scheme "_:" here. >> >> Mapping RDF onto PropertyGraphs -- >> >> A/ Each subject or object URI or bNode becomes a vertex with a URI or bNode as label. >> >> B/ Each property value of subject V, if it has a literal object, it becomes an attribute of V, named by the URI of the property. > > From my understanding, a PG attribute is single-valued (e.g., [attr=value], whereas RDF properties can be multi-valued (:s :attr :value1, :value2). How might we handle this? Modeling literal values as a vertex might solve this, but it might require using something like [rdf:value "the literal value"; :datatype <datatype>; :language "en" ]. Vertexes being literal-labeled is a possibility. I think it makes the PG "unnatural" - I trying to send up with something that worked as a PG. In PG (well, the simple java implementation), it seems the value can be anything, including a set, bag or list. Storing a JSON doc as a value puts structure in, not 1st class but not 2nd class either. As the "authors of a paper" prototypical example shows, datastructures matter. The role of index to find vertexes by property-values is fairly explicit. Maybe structures won't work out; maybe they will. The data model isn't a simple, automatic choice. IMO RDF should have had 1st class data structures - all this RDF list (seq,alt,bag) encoded in triples does not play out in practical terms. Users work in lists and they should not have to worry about encoding in triples/edges. Eventually, the triples show through and it's messy for the user. The nice theoretic model isn't nice in practice. I also recognize that list or set values themselves have serious implications at scale and if persistent storage is involved (or should we assume NVRAM?). Andy > >> C/ Each property value of subject V that is a URI or blank node, becomes an edge in the property graph, with label the URI of the property. >> >> Attributes on edges are not used. >> >> In the language of the RDF-2004 working group, literals are not tidy [2] in property graphs. >> >> In RDF-speak: >> >> :a :p 123 . >> :b :q 123 . >> >> and pattern >> { ?x :p ?v . >> ?y :q ?v . } >> >> works because 123 has in-edges of :p and :q. It is a tidy literal meaning there is on node in the graph for 123, not two. >> >> This does not happen in PropertyGraphs; access to attributes is different to accessing vertexes (e.g. the ".has" operation in gremlin). >> >> This mapping also suggests how to turn a property graph without attributes on edges into an RDF graph. Edges with attributes can be encoded as n-ary relationships [3]. > > I like the idea of N-ary relationships for such modeling; they are similar to my JSON-LD proposal, but avoid the use of BNode predicates. > > Gregg > >> An alternative mapping is to not use attributes at all. Each subject or object becomes a vertex, and every triple becomes and edge. But now the labelling of the vertexes needs to include datatype/URI information. >> >> This is much more an encoding of RDF onto PG, rather than a mapping. >> >> In an encoding, the information is there and can be extracted but working directly on the encoding with the PG tools can be unnatural. In a mapping, you could query and navigate the PG graph using PG tools e.g. look up all vertexes with an attribute of 123. >> >> Andy >> >> [1] http://lists.w3.org/Archives/Public/semantic-web/2013Aug/0026.html >> [2] http://www.w3.org/TR/2002/WD-rdf-concepts-20020829/ >> [3] http://www.w3.org/TR/swbp-n-aryRelations/ >> >
Received on Friday, 3 January 2014 21:11:04 UTC