- From: Thomas Lörtsch <tl@rat.io>
- Date: Thu, 16 May 2024 11:06:01 +0200
- To: Miel Vander Sande <miel.vandersande@meemoo.be>
- CC: RDF-star Working Group <public-rdf-star-wg@w3.org>, "Peter F. Patel-Schneider" <pfpschneider@gmail.com>
Am 16. Mai 2024 10:52:14 MESZ schrieb Miel Vander Sande <miel.vandersande@meemoo.be>: >Hi Thomas, > >Check, understood! So the mapping is mostly for legacy systems. Yep, but also to express in simpler and well-understood ways what the triple term and its annotations mean. > Are stores >that implement RDF-star expected to interpret these standard RDF documents >as RDF-star semantics? As discussed there is a slight difference in that the connection between asserted statement and annotation in any triple-centric mapping to RDF (*) is never as solid as it is in an approach that treats triple terms as asserted (which this proposal does, just like the original RDR/RDF* proposal). Apart from that the semantics are the same. Of course queries will differ which is probably more relevant in most cases but again unavoidable. I hope that answers your question. If not pease don't hesitate to ask for further detail! Best, Thomas (*) I fitst wanted to say "in any mapping" but that would be wrong as n-ary relations are standard RDF and can express anything. However, they are akin to relational representations and don't provide the easy to read/query simple triple with the main assertion that makes RDF attractive. > >Best, > >Miel > >On Thu, 16 May 2024, 10:01 Thomas Lörtsch, <tl@rat.io> wrote: > >> Hi Miel, >> >> this proposal is based on RDF-star triple terms. Its semantics and the >> mapping to standard RDF however are based on the Singleton Properties >> proposal. So the problems with efficiency of Singleton Properties don’t >> occur if a store implements RDF-star triple terms. That’s explained in more >> detail in the proposal, but I understand that this can be overlooked at >> first reading because it’s a long mail. Maybe check the section titled CORE >> again. >> >> Best, >> Thomas >> >> >> > On 16. May 2024, at 09:19, Miel Vander Sande <miel.vandersande@meemoo.be> >> wrote: >> > >> > Hi Thomas, >> > >> > Thanks for the proposal, but I'm missing something important. The main >> concern about singleton properties is the way Triple Stores can efficiently >> handle them. They often rely on a low number of unique predicates. If >> compatibility/mapping with standard RDF is kept, would that not make it >> hard or impossible for triple stores to optimise these singleton >> properties? And if singleton properties are somehow treated differently, >> are they not the same as a statement identifier that is independent from >> the predicate (as proposed by some of the other proposals)? >> > >> > How would you position your proposal wrt this? Al least for me, >> expressiveness is useless if it can be queried or retrieved within >> reasonable time. >> > >> > Cheers, >> > >> > Miel >> > >> > >> > >> > >> > On Wed, 15 May 2024, 14:32 Thomas Lörtsch, <tl@rat.io> wrote: >> > YET ANOTHER GRAND UNIFYING PROPOSAL >> > =================================== >> > >> > What appeared as the way forward last winter is getting more and more >> convoluted as the details are discussed. I agree with Niklas that this is a >> result of the choice for Option 3, but it also is sign of a deeper problem: >> we might still don’t work with the right primitives and still don’t have a >> solid understanding of the problem we’re dealing with. I’m well aware that >> everybody is tired and wants to be done with all this, but it seems to me >> that we should change course, again. I’ll discuss some background first, >> but then make a pretty concrete proposal of how to attack the problem by >> combining the syntax of RDF-star with the semantics of singleton >> properties. IMHO it has some very concrete advantages: less triples, less >> confusing indirections, and more intuitive semantics. >> > >> > >> > BACKGROUND >> > ========== >> > >> > I see two main use cases for statement annotation: >> > >> > - n-ary relations >> > with a primary topic and secondary, qualifying attributes >> > -> that can be interpreted as INSTANTIATION >> > >> > - metadata annotations >> > that are orthogonal to the topic of the statement >> > -> that can be interpreted as REIFICATION >> > >> > The metadata use case (quite often characterized as provenance) is >> strong in RDF land with its focus on integration of data from heterogeneous >> sources. In LPG land much more emphasis is put on structuring the graph >> into easy to navigate main relations and their less important details (and >> attributed objects, but that’s another topic). Of course that is just a >> very rough characterization, and overlaps in both directions are common. >> > >> > The metadata use case is well captured by REIFICATION because >> reification stays clear of the annotated statement itself (lets keep in >> mind that reification is a general concept and don’t associate it with the >> syntactic verbosity its implementation in RDF for a moment). There is an >> air gap between the statement and its reification that ensures that the >> original statement is unencumbered and unchanged by the annotation. This is >> good for the metadata use case but it is not easy to understand as recent >> mail exchanges on the list between Olaf, Niklas, Bryan and Peter have shown >> (again) and the indirection can cause irritating and unfortunate effects. >> > >> > In an n-ary relation the main relation can be understood as an >> INSTANTIATION of the type of relation it represents, with each instance >> having its own secondary attributes as qualifications. Instantiation is a >> concept that is well understood and maps nicely to everyday >> conceptualizations like "a car" (engine, four wheels, etc) and "my car" >> (again engine, four wheels, etc, but also a sedan, blue, old, etc). >> Instantiation is what drives the semantics of the singleton property >> approach. >> > >> > Of course, the distinction is more one of tendencies than of a hard >> separation: reification can represent n-ary relations and instantiation via >> n-ary relations can represent metadata, one person’s data is another >> person’s metadata, etc. However, in both cases that comes at a certain cost >> in intuitiveness and naturalness. If used wrongly, subtle breaks can be >> introduced that may lead to surprising and undesirable results. >> > >> > The problem with the singleton property approach as proposed by Nguyen >> et al is that it tries to achieve its goal without a change to the syntax >> of RDF. It lacks the boldness of RDF-star to introduce a new term type into >> RDF. This makes it verbose, hard to optimize and requiring to entail the >> primary relation as if it was an additional detail, an afterthought even. >> On the other hand RDF-star was intially a syntax without a well-defined >> semantics, or model theory even, and this WG still struggles to make it all >> work out. This here is an attempt to re-use the singleton property >> approach as the semantic underpinning of the RDF-star syntax, or, put the >> other way, to augment the singleton property approach with the RDF-star >> syntax, thereby getting rid of its verbosity. So let’s get to it. >> > >> > >> > CORE >> > ==== >> > >> > 1) a return to RDR and pre-CG RDF*: EACH TRIPLE TERM IS ASSERTED, e.g. >> > >> > << :s :p :o >> :b :c . >> > >> > asserts ' :s :p :o ' and annotates it in one go. This gets rid of the >> need for an annotation/shorthand syntax and it safes an extra triple to >> actually assert the assertion. It captures the predominant intuition: >> saying something and adding detail to it. At this point it doesn’t matter >> much if that detail is metadata or qualifying detail. What matters is that >> both are solidly connected, not separated (and prone to mixups and >> misunderstandings through overlapping multi-edge situations). >> > A query for { :s :p :o } in SPARQL-star on the above example must >> retrieve the statement ':s :p :o' from the triple term << :s :p :o >>, etc >> - "Turtle with holes". >> > This means that in common scenarios there is zero overhead because of >> singleton property verbosity and entailments, unasserted assertions, etc. >> The main use case is very straightforward to use (and implement, I reckon). >> > >> > >> > 2) the RDR/RDF* proposal is extended with TRIPLE TERM IDENTIFIERS not >> unlike the current WG proposal, but with a twist: user-provided identifiers >> are handled differently (more on that below). Just as in the current WG >> proposal a bnode identifier is provided by the system for every triple >> term, e.g. the above '<< :s :p :o >> :b :c .' is equivalent to >> > >> > << _:p1 | :s :p :o >> :b :c . >> > >> > The triple term is now a QUAD-TUPLE: the identifier becomes part of the >> triple term also in model and abstract syntax, getting rid of the abstract >> triple term type (the thing syntactically expressed as '<<( :s :p :o )>>' >> in the current proposal - however, we will reuse that syntax, see below). >> This identifier is equivalent to the singleton property itself in the >> approach so named. >> > The statement identifier, refering to an instance/occurrence of the >> abstract statement, is essential to capture the semantics of most use >> cases, not the least LPG uses, where statements (or edges in LPG) of the >> same type can occur multiple times, each with different and not to be mixed >> up sets of annotations. >> > >> > The current WG proposal offers to users the possibility to explicitly >> define an IRI en lieu of but semantically equivalent to the system provided >> bnode, e.g. >> > >> > << :x | :s :p :o >> :b :c . >> > >> > Its purpose is to work around the limitations of line-based >> serializations. We do this too, but in a different way (that’s the TWIST >> hinted at above): an explicitly provided identifier is stored separately >> from and additionally to the system provided bnode. The quad therefore >> conceptually becomes a QUIN-TUPLE - however, stores may choose to just >> store the explicit identifier via an extra statement, like in the mapping >> discussed next. The rationale behind this arrangement will become apparent >> below when we discuss many-to-many relations, sets and graphs. >> > >> > >> > 3) a MAPPING to standard RDF is based on the singleton property >> approach, e.g. >> > >> > :s :p :o . >> > :s _:p1 :o . >> > _:p1 rdf12:singletonOf :p ; # _:p1 is a singleton property of _:p >> > rdf12:id :x ; # :x is a user-provided identifier >> refering to _:p1 >> > :b :c . >> > >> > This should work well through the whole installed base and stack of >> RDF/RDFS/OWL/etc, at least in principle (issues e.g. with missing predicate >> indexes notwithstanding). [0] have found that singleton properties have >> quite favorable properties w.r.t. reasoning (and even more so if, in >> contrast to those authors, one interprets singleton annotations not as >> constraints but as additional detail). >> > >> > >> > I claim that so far all this is pretty straightforward and covers the >> vast majority of real world usage. It is cleaner and more concise than the >> current proposal in that it doesn’t separate assertion from annotation, it >> saves that extra un-asserted triple in storage and it makes the shorthand >> annotation syntax superfluous. >> > >> > >> > Some details are important to understand: >> > >> > - rdf12:singletonOf rdfs:subPropertyOf rdf:type . >> > This reflects the intuition that each singleton is unique, an intuition >> that is better expressed as instantiation than as subclassing. >> > >> > - the verbosity and optimization troubles of singelton properties, as >> evidenced in the mapping, only occur in environments that don’t support >> RDF-star triple terms (otherwise what would we need RDF-star for ;-). >> > >> > - the mapping loses the strong connection between a statement and its >> annotation. Just as with the current WG proposal it is possible to have the >> same statement asserted and, e.g. after merging a different source, >> annotated but assumedly un-asserted. The latter information will get lost, >> making the whole concept of un-asserted assertions brittle and unreliable. >> The current WG proposal always has this problem, this proposal here only >> when mapping to standard RDF. >> > >> > - the main difference between our proposal and the singleton property >> approach is that we reverse access: we put the un-annotated statement in >> the foreground (by means of the triple term syntax), both in the user >> facing syntax and at the implementation level, whereas in the singleton >> property approach it has to be entailed from the annotated singleton >> statement. This makes our proposal much more straightforward to use and >> implement. >> > >> > >> > >> > EXTENSIONS >> > ========== >> > >> > The current WG proposal tries to cover more ground than just statement >> annotation, most notably annotating un-asserted assertions, but also other >> stuff that depending on perspecticve seems like low-hanging fruit, >> especially annotating sets of statements and referentially opaque statement >> annotations. We argue that those are orthogonal demands and should be >> implemented in a way that doesn’t complicate the above very simple basic >> arrangement. It seems however that it is possible to achieve this with >> modest effort. >> > >> > >> > UNASSERTED ASSERTIONS >> > >> > We re-use the syntax of abstract triple terms from the current WG >> proposal to encode unasserted assertions, as the concept of abstract triple >> terms is obsolete in our approach. Like triple terms they are four-tuples, >> i.e. they always have an identifier implicitly provided by the system as a >> bnode. As the use case is rather niche we consider the introduction of an >> unasserted assertion in model and abstarct syntax overkill, but instead >> advocate to implement the syntax as syntactic sugar for standard >> reification, e.g. >> > >> > <<( :s :p :o )>> :b :c . >> > >> > in standard RDF maps to standard reification: >> > >> > _:p2 rdf:type :rdf:Statement ; >> > rdf:subject :s ; >> > rdf:predicate :p ; >> > rdf:object :o ; >> > :b :c . >> > >> > The same for explicitly named un-asserted assertions like e.g. '<<( :x | >> :s :p :o )>> :b :c .' >> > >> > _:p2 rdf:type :rdf:Statement ; >> > rdf:subject :s ; >> > rdf:predicate :p ; >> > rdf:object :o ; >> > rdf12:id :x ; >> > :b :c . >> > >> > >> > REFERENTIAL OPACITY >> > >> > Referential opacity has come up again lately and although I’m pretty >> wary of the concept I can see a way in which its introduction will probably >> not harm RDF in general. Most of all I like to see it as an orthogonal >> concern that should not be entagled with annotations or un-asserted >> assertions as such. Therefore I take up the idea of introducing yet another >> syntax (by Enrico IIRC, in a recent telco), e.g. >> > >> > <<" :s :p :o ">> :b :c . >> > >> > Again this may be implemented as a new term type in model and abstract >> syntax, or we may follow Antoine Zimmermanns proposal for an RDF literal >> datatype. The latter can be employed to define referential opacity as >> syntactic sugar and map to standard RDF maps as follows: >> > >> > :s :p :o . >> > :s _:p3 :o . >> > _:p3 rdf12:singletonOf :p ; >> > :b :c ; >> > rdf12:ofArtefact ":s :p :o"^^rdf:ttl . >> > >> > The RDF literal datatype documents precisely the syntactic >> representation of the statement. This is a very un-intrusive approach to >> referential opacity and IMHO won’t get in the way of standard RDF >> procedures. Of course it doesn’t prevent undesirable entailments from being >> made, as the approach to referential opacity taken by the CG proposal does, >> but at least it allows to track them back to the original source and treat >> them accordingly. Bnodes might either not be handled or be covered more >> fully than in teh CG report proposal by allowing artefacts to be concise >> bounded descriptions. E.g. an artefact ":s :p _:b1 . _:b1 :d :e , :f >> ."^^rdf:ttl would give a full account of the meaning of _:b1 at the time >> the artefact was created. >> > >> > >> > REFERENTIALLY OPAQUE UNASSERTED ASSERTIONS >> > >> > Again, referential opacity and unasserted assertions are orthogonal >> concerns, and therefore yet another syntax is introduced to combine the >> two, e.g. >> > >> > <<(" :s :p :o ")>> :b :c . >> > >> > Following the above proposals this is mapped to standard RDF by adding >> the literal representation to the reification quad, e.g. >> > >> > _:p4 rdf:type :rdf:Statement ; >> > rdf:subject :s ; >> > rdf:predicate :p ; >> > rdf:object :o ; >> > rdf12:hasArtefact ":s :p :o"^^rdf:ttl ; >> > :b :c . >> > >> > Considering the mindboggling level of disambiguation that this >> arrangement provides the complexity isn’t too bad IMHO ;-) >> > >> > >> > MANY-TO-MANY, SETS, GRAPHS >> > >> > Like the current WG proposal this approach doesn’t rule out many-to-many >> relations, e.g. >> > >> > << :x | :s :p :o >> :b :c . >> > << :x | :u :v :w >> :b :c . >> > >> > We might even consider to introduce a supporting syntax, aka GRAPH >> TERMS, e.g. >> > >> > >> > << :s :p :o . >> > :u :v :w >> :b :c . >> > >> > or, explicitly named >> > >> > << :x | :s :p :o . >> > :u :v :w >> :b :c . >> > >> > I don’t want to push the envelope too far (given the constraints imposed >> by the charter, the controverses around the topic, etc) but it’s good to >> see that this is syntactically straightforward - it isn’t with the >> shorthand annotation syntax of the WG proposal. >> > >> > Anyway, employing the mapping to standard RDF from CORE above, we get a >> straightforward definition of the meaning of many-to-many annotations (no >> matter if they come as singleton terms or as hypothetical graph terms), >> e.g. mapping the above many-to-many relation to >> > >> > :s :p :o . >> > :u :v :w . >> > :s _:p5 :o . >> > _:p5 rdf12:singletonOf :p ; >> > rdf12:id :x ; >> > :b :c . >> > :u _:v1 :w . >> > _:v1 rdf12:singletonOf :v ; >> > rdf12:id :x ; >> > :b :c . >> > >> > This establishes a FOR-EACH semantics: annotations of the graph term are >> annotating each triple, not the graph (or set of triples if one prefers >> that slightly looser wording) itself. The same is true for annpotations on >> :x: they too are mapped to all statements so named, e.g. >> > >> > << :x | :s :p :o . >> > :u :v :w >> :b :c . >> > :x :d :e . >> > >> > is mapped to >> > >> > :s :p :o . >> > :u :v :w . >> > :s _:p5 :o . >> > _:p5 rdf12:singletonOf :p ; >> > rdf12:id :x ; >> > :b :c ; >> > :d :e . >> > :u _:v1 :w . >> > _:v1 rdf12:singletonOf :v ; >> > rdf12:id :x ; >> > :b :c ; >> > :d :e . >> > >> > >> > To annotate the set of triples itself one would have to create an >> explicit reference via the identifier :x, e.g. >> > >> > :x rdf12:asObject [ # or "rdf12:asGraph", "rdf12:asSet" ... >> > :f :g >> > ] . >> > >> > A possible use case might be to express that a set of statements >> together describe a situation, postulate a theory, etc. >> > This arrangement can also be used to annotate singleton statements as >> objects of there own right (not as annotations to the predicate). >> Semantically this is probably closer to reification than to n-ary >> relations, but I’m not really sure myself what to make of it. In any case >> it is more expressive than the current WG proposal which provides no means >> to differentiate between the object and its content (httpRange-14 raising >> its ugly head again, I guess) >> > >> > >> > However, this arrangement has repercussions on the interpretation of >> annotations on :x (or _:x) in x-to-one cases, i.e. when single statements >> are annotated, because it can only mean that also those annotations refer >> to the statement as a whole, not its n-ary property. This is a departure >> from the current state which leaves this question open - and sure to cause >> some irritation. >> > >> > >> > TBC… I’m leaving it at this for reasons of time, but also to solicit >> some general comments. The details most certainly need some more tweaking, >> as so far all proposals did. The means to explicitly name an occurrence >> were introduced to overcome the limitations of serialization, but they do >> open the door to many-to-many relations, and that comes in handy when >> discussing sets and graphs. However it mixes orthogonal concerns, so may >> have unintended consequences. I expect this arrangement to be >> controversial, and maybe buggy. Comments welcome! >> > >> > Best, >> > Thomas >> > >> > [0] https://link.springer.com/chapter/10.1007/978-3-319-58068-5_39 >> > >> > >> > > On 2. May 2024, at 16:00, Peter F. Patel-Schneider < >> pfpschneider@gmail.com> wrote: >> > > >> > > The singleton property approach has benefits and downsides. The >> quoted triple approach has benefits and downsides. >> > > >> > > One very big advantage of the singleton property approach is that it >> is (barely) possible to use it with any RDF system, even RDF systems that >> have no optimizations. A big disadvantage of the quoted triple approach is >> that it requires new syntax, new semantics, and new implementations. >> > > >> > > One cannot successfully argue that just because the singleton property >> approach may require more triples that it is inherently worse than the >> quoted triple approach. RDF implementations can be tuned to the singleton >> property approach, providing special data structures for singleton >> properties and special code to optimize SPARQL queries for the singleton >> property approach. >> > > >> > > One possible way to do this is to use a special approach for singleton >> properties where the internal name of the blank node encodes the parent >> property. This could result in minimal or even no storage overhead for >> singleton properties. Of course the implementation effort to make this >> completely transparent would be significant, but then so is the effort to >> make a performative implementation of quoted triples. >> > > >> > > I note that in this approach the singleton property triples would look >> very much like multiple edges, i.e., this could be considered to be a >> space-efficient implementation of RDFn. >> > > >> > > peter >> > > >> > > >> > > On 4/30/24 15:46, Thompson, Bryan wrote: >> > >> Your proposal would require two statements on top of the original SPO >> statement before you should begin to make assertions about the original SPO >> statement? >> > >> Anything based on the singleton property approach will have quite an >> impact on database statistics. The number of used predicates would jump >> from millions (for open linked data) to the cardinality of the statements >> about which statements are being made (e.g., billions, 10s of billions, >> etc.). @Williams, Gregory <mailto:ngregwil@amazon.com> or @Schmidt, >> Michael <mailto:schmdtm@amazon.com> can comment on this, but this >> certainly places a new burden on common techniques for extracting >> statistics from a graph. >> > >> Note that there is really no reason to rely on the P position in your >> proposal. You could use S since it already allows blank nodes. You then >> hang the Subject of the original asserted SPO on the statement about that >> unique subject. (Or you could use O, which might be kinder for database >> statistics since they tend to focus on SP* analysis.) >> > >> _:si :statementInstanceHasSubject :s . >> > >> _:si :p :o . >> > >> :s :p :o. >> > >> I have been impressed in the past with the space and time overhead >> which arises out of various modeling decisions around possible statements >> about statements treatments. I would recommend carefully considering that >> impact. Another 2 triples makes a huge difference when all statements >> carry annotations, as they do in some domains. For example, consider the >> relatively common case in which you have a graph consisting of a topology >> and edge weights. This is very common - lots of graphs are simply edges >> and their weights. As I understand it, your proposal would have 3 times >> the data volume to model the topology (some set of edges) in a manner which >> would permit associating edge weights with the edges in that topology. And >> the database would need to chase a long chain to obtain those edge weights >> in a correct manner: :s :p :o. => :s _:pi :o => _:pi rdfs:subPropertyOf :p >> . => _:pi :hasWeight 1.0. The cost of chasing that chain would make >> applications relying on edge weights very expensive in both time and >> space. I can't see that as being responsive to such use cases. To be >> efficient, there needs to be a close association between an edge and the >> properties of that edge. Their resolution needs to be very efficient. >> > >> Also note that this singleton property proposal would not support >> alignment in the data (interoperability in the data) with LPG edge >> properties. So it would fail to offer a unification path for the common >> use cases of RDF and LPG. >> > >> Thanks, >> > >> Bryan >> > >> >> ------------------------------------------------------------------------------ >> > >> *From:* Peter F. Patel-Schneider <pfpschneider@gmail.com> >> > >> *Sent:* Tuesday, April 30, 2024 10:40:18 AM >> > >> *To:* public-rdf-star-wg@w3.org >> > >> *Subject:* RE: [EXTERNAL] The singleton property option >> > >> CAUTION: This email originated from outside of the organization. Do >> not click links or open attachments unless you can confirm the sender and >> know the content is safe. >> > >> I think that this is far too strong. The singleton property >> approach has >> > >> problems, but not to this extent. >> > >> For any statement that does not require annotation, the singleton >> property >> > >> approach does not require any changes at all, i.e., just use >> > >> :s :p :o . >> > >> For a statement that does require annotation, the singleton property >> requires >> > >> two or three triples, one to make the blank node a subproperty of the >> desired >> > >> property, one to state the relationship using the blank node, and, if >> the RDF >> > >> system does not implement RDFS semantics, one to make the statement >> using the >> > >> regular property, i.e., >> > >> _:pi rdfs:subPropertyOf :p . >> > >> :s _:pi :o . >> > >> :s :p :o. >> > >> The added storage for this might be less than that needed for >> efficient >> > >> processing of quoted triples, particularly if the third statement is >> not needed. >> > >> There is no need to change modelling if the statement is annotated >> after the fact. >> > >> peter >> > >> On 4/30/24 12:26, Thompson, Bryan wrote: >> > >>> The singleton property approach undermines the direct use of >> predicates in >> > >>> statements and forces a second hop for any use case to determine the >> actual >> > >>> predicate used. It also requires that the "statement" is modeled >> differently >> > >>> in advance, thus increasing the space requirements even if no >> statements about >> > >>> statements are used. >> > >>> >> > >>> >> > >>> This is not efficient. >> > >>> >> > >>> >> > >>> Effectively, the singleton property model says that the RDF triple >> is wrong. >> > >>> It says that you should model using (S ID O) and then model the >> predicate and >> > >>> other information as statements about that ID. This is not the RDF >> model. >> > >>> >> > >>> >> > >>> The approach with Statements about Statements should IMHO be built >> on (S P O >> > >>> ID). That is, there is a unique identifier for the SPO and you make >> > >>> statements about that statement ID. >> > >>> >> > >>> >> > >>> Bryan >> > >>> >> > >>> >> ------------------------------------------------------------------------------ >> > >>> *From:* Thomas Lörtsch <tl@rat.io> >> > >>> *Sent:* Tuesday, April 30, 2024 12:02:21 AM >> > >>> *To:* public-rdf-star-wg@w3.org; Thompson, Bryan; Niklas Lindström; >> RDF-star >> > >>> Working Group >> > >>> *Subject:* RE: [EXTERNAL] The singleton property option >> > >>> >> > >>> *CAUTION*: This email originated from outside of the organization. >> Do not >> > >>> click links or open attachments unless you can confirm the sender >> and know the >> > >>> content is safe. >> > >>> >> > >>> >> > >>> Brian, >> > >>> >> > >>> Niklas combines the RDF-star syntax with the semantics of Singleton >> > >>> Properties. AFAIK no implementations of or papers on Singleton >> Properties have >> > >>> done that. This combination doesn't even require an index on >> properties. >> > >>> >> > >>> This combination is nearer to the original RDR approach than >> anything else >> > >>> discussed by CG and WG. It is IMO a very neat idea and deserves a >> closer look. >> > >>> >> > >>> Thomas >> > >>> >> > >>> >> > >>> >> > >>> Am 29. April 2024 19:06:37 MESZ schrieb "Thompson, Bryan" < >> bryant@amazon.com>: >> > >>> >> > >>> The singleton property approach has many downsides and is >> pragmatically >> > >>> unworkable. There is a good reason people are not happy with >> this approach. >> > >>> >> > >>> >> > >>> Bryan >> > >>> >> > >>> >> ------------------------------------------------------------------------------ >> > >>> *From:* Niklas Lindström <lindstream@gmail.com> >> > >>> *Sent:* Friday, April 26, 2024 2:08:41 PM >> > >>> *To:* RDF-star Working Group >> > >>> *Subject:* [EXTERNAL] The singleton property option >> > >>> CAUTION: This email originated from outside of the >> organization. Do not >> > >>> click links or open attachments unless you can confirm the >> sender and know >> > >>> the content is safe. >> > >>> >> > >>> >> > >>> >> > >>> For completeness (and perhaps to widen the perspective), here >> is the >> > >>> singleton property option I briefly mentioned on the semantics >> call >> > >>> (and alluded to in [1]). Also see [2] for the original; this is >> just a >> > >>> quick strawman adaptation for the benefit of the LPG >> perspective. >> > >>> >> > >>> It extends RDF 1.1 differently; no triple terms, no opacity, >> just: >> > >>> >> > >>> 1. Allow bnodes as predicates (blank predicates). >> > >>> 2. Define rdf:singletonPropertyOf for linking those to the >> property >> > >>> they represent instances/occurrences/edges of. >> > >>> >> > >>> 3. Well-formedness conditions: >> > >>> 3.1 Bnode predicates are only to be used once; with one s and o >> > >>> (similar to list cons nodes, who are "single purposed"). >> > >>> 3.2 The rdf:singletonPropertyOf is semantically functional >> (exactly >> > >>> like rdf:first and rdf:rest). >> > >>> >> > >>> 4. For optimization, implementations can put triples with blank >> > >>> predicates in a dedicated table (using edgename as unique key), >> > >>> relying on well-formedness for cohesion. Such a table is >> completed in >> > >>> two steps: 1) the singleton assertion inserts s and o for >> edgename; 2) >> > >>> the rdf:singletonPropertyOf assertion inserts p for edgename. If >> > >>> well-formedness is broken, all optimization bets are off. >> Perhaps a >> > >>> dedicated skolemization scheme can be employed for some more >> control >> > >>> and/or "unstarring". >> > >>> >> > >>> 5. RDF-star syntax obviously needs no naming syntax; naming >> these >> > >>> would break well-formedness. >> > >>> 6. What these *mean* of course needs a good definition (property >> > >>> specializations, edge type instances or similar). Are they >> asserted? >> > >>> Sure. Do they assert something using their >> rdf:singletonPropertyOf >> > >>> property as predicate? No. (Could they? Well, they can be >> declared >> > >>> ("inline") to *also* be subPropertyOf the same property, and >> through >> > >>> entailment that would happen.) >> > >>> 7. Reifiers become a usage pattern (informative) as suggested >> from the >> > >>> property edge perspective. Any desired :reifiedBy or :partOf >> relation >> > >>> can link predicate singletons to one or more "reifiers". >> > >>> >> > >>> Basic example: >> > >>> >> > >>> << :s :p :o >> :source <stream662be7ba> ; >> > >>> :timestampMills 1714153402 . >> > >>> >> > >>> Expands to: >> > >>> >> > >>> :s _:e1 :o . >> > >>> _:e1 rdf:singletonPropertyOf :p ; >> > >>> :source <stream662be7ba> ; >> > >>> :timestampMills 1714153402 . >> > >>> >> > >>> Annotation syntax: >> > >>> >> > >>> :s :p :o {| :reifiedBy <#reifier> |} . >> > >>> >> > >>> Expands to: >> > >>> >> > >>> :s :p :o . >> > >>> :s _:e1 :o . >> > >>> _:e1 rdf:singletonPropertyOf :p ; >> > >>> :reifiedBy <#reifier> . >> > >>> >> > >>> Possible singleton property entailment?: >> > >>> >> > >>> _:e1 a rdf:SingletonProperty; >> > >>> rdf:subject :s ; >> > >>> rdf:prediate :p ; >> > >>> rdf:object :o . >> > >>> >> > >>> Will entailment break well-formedness if (accidentally?) *put >> back* >> > >>> into a regular graph? Of course, just as RDF lists are "broken" >> > >>> whenever that happens (as in look terrible when serialized, >> make no >> > >>> sense when queried, etc.). >> > >>> >> > >>> Best regards, >> > >>> Niklas >> > >>> >> > >>> [1]: >> > >>> < >> https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html >> > >>> < >> https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html >> > >> < >> https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html >> >>> >> > >>> [2]: <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/ >> > >>> <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/ >> > >> <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/>>> >> > >>> >> > > >> > >> > >> >>
Received on Thursday, 16 May 2024 09:06:13 UTC