- From: Kurt Cagle <kurt.cagle@gmail.com>
- Date: Thu, 23 May 2024 12:46:57 -0700
- To: "ddooss@wp.pl" <ddooss@wp.pl>
- Cc: Thomas Lörtsch <tl@rat.io>, RDF-star Working Group <public-rdf-star-wg@w3.org>, "Peter F. Patel-Schneider" <pfpschneider@gmail.com>
- Message-ID: <CALm0LSF7CYnLyMF+x=cXRHC8NDGkCAR=GqPP1nSCH-cJyP=3eg@mail.gmail.com>
Dominik, Thanks, I will incorporate the Cyper coding as well. *Kurt Cagle* Editor in Chief The Cagle Report kurt.cagle@gmail.com 443-837-8725 <http://voice.google.com/calls?a=nc,%2B14438378725> On Thu, May 23, 2024 at 12:33 PM ddooss@wp.pl <ddooss@wp.pl> wrote: > Dear Kurt > > Thank you for sharing your document on the proposed enhancements to RDF > reification and LPG harmonization. Your approach to addressing these issues > separately while utilizing a similar notation is intriguing and seems quite > promising. > > The named node expressions, in particular, provide a clear method for > transforming typically ephemeral blank nodes into actionable, referencable > elements. > > I would appreciate a formal presentation of these concepts in our next > meeting. A detailed exposition will help ensure that everyone understands > the intricacies of your proposal and will facilitate a thorough comparison > with other existing proposals. I definitely think that we need to have > formal definitions, as well as the semantics of your proposal. > > Regarding translating these ideas into a DDL/DML language like Cypher, > could you provide an example that mirrors the LPG scenario described? > Demonstrating how these RDF constructs could be represented in Cypher would > aid in evaluating their practical applicability in a graph database context. > > Best regards, > Dominik > > Dnia 23 maja 2024 19:24 Kurt Cagle <kurt.cagle@gmail.com> napisał(a): > > I've attached a document that covers YET ANOTHER proposal (more properly a > recommendation I've made before). > > There are two issues that we seem to be rehashing here. The first is the > question of reificational notation, while the second has to do with LPG > harmonization. My contention is that these are different issues, though we > can use similar notation for both. > > *Reification* > > A named reification is simply a set of statements: > > :r rdf:subject :s; rdf:predicate :p; rdf:object :o . > > This is not a triple. It is three statements about the state that a triple > can be in. It does not introduce a triple into the system,it makes no > assertions about the truthiness or even, by itself existence of that > triple. It is simply a statement about the components that a triple might > have. You cannot reason with it directly, though you can use other > processes (SPARQL, SHACL, etc.) to construct or verify the existence of > triples for which these assertions are true. Properly speaking, the above > itself should probably be qualified: > > :r rdf:subject :s; rdf:predicate :p; rdf:object :o ; a rdf:Reification . > > The notation << :r | :s :p :o >> makes the above statement more compact, > but the reification can apply to any triples within a system, or none at > all, regardless of the values. > > *Named Node Expressions* > > I propose, in the attached, that we use a similar nomenclature for what > I'm turning named node expressions, to whit: > > [ ?nn | :p1 :o1 ; :p2 :o2 ] > > where ?nn is replaced by a formal (not blank) IRI. > > This is a Turtle (not RDF) syntactical amendment. The above takes what > would ordinarily be a blank node and replaces it with a named node: > > For instance: > > :liz :hasMarriage [ :marriage 1 | :to :Ricard, :start "1965" ; :end "1975" > ]. > > which expands to: > > :liz :hasMarriage :marriage 1 . > :marriage 1 :to :Richard . > :marriage 1 :start "1965" . > :marriage 1 :end "1975" . > > Why is this important? Because the blank node is a pointer to a data > structure, but use of the [] notation makes it impossible to reference that > data structure from within Turtle. By adding in a named node as the > referencing node, you gain that ability, and it is a key ability for > modeling. > > For instance, I can use the expression: > > :liz :hasMarriage [ :marriage 1 | :start "1965" ; :end "1975"; :to > :richard ], [ :marriage 2 | :start "1975" ; :end "1985"; :to :john]. > > This is semantically equivalent to the JSON > > {"liz":{"hasMarriage":[{"marriage1":{"start":"1965", > "end":"1975","to":"richard"}},"marriage1":{"start":"1965", > "end":"1975","to":"richard"}}]}} > > The same thing can be done with both predicate-positioned named node > expressions and subject-oriented ones. > > This addresses the LPG equivalency relationship, and does so without ever > touching reifications. > > Note that this also highlights an important point. Blank nodes are useful > because they are unique and system-assigned. However, they are not > referenceable. The Turtle notation: > > :liz :hasMarriage _:b1, _:b2 . > _:b1 :start "1965" ; :end "1975"; :to :richard . > _:b2 :start "1975" ; :end "1985"; :to :john . > > is simply a preprocessor directive to replace the "named" nodes with > anonymous IRIs in the final indexing. You still have to make _:b1 and _:b2 > unique, or the data structures disintegrate. > > Anyway, I ask the chair for time during our next meeting to discuss this > proposal. > > *Kurt Cagle* > Editor in Chief > The Cagle Report > kurt.cagle@gmail.com > 443-837-8725 <http://voice.google.com/calls?a=nc,%2B14438378725> > > > On Wed, May 15, 2024 at 5:32 AM Thomas Lörtsch <tl@rat.io> wrote: > > YET ANOTHER GRAND UNIFYING PROPOSAL > =================================== > > What appeared as the way forward last winter is getting more and more > convoluted as the details are discussed. I agree with Niklas that this is a > result of the choice for Option 3, but it also is sign of a deeper problem: > we might still don’t work with the right primitives and still don’t have a > solid understanding of the problem we’re dealing with. I’m well aware that > everybody is tired and wants to be done with all this, but it seems to me > that we should change course, again. I’ll discuss some background first, > but then make a pretty concrete proposal of how to attack the problem by > combining the syntax of RDF-star with the semantics of singleton > properties. IMHO it has some very concrete advantages: less triples, less > confusing indirections, and more intuitive semantics. > > > BACKGROUND > ========== > > I see two main use cases for statement annotation: > > - n-ary relations > with a primary topic and secondary, qualifying attributes > -> that can be interpreted as INSTANTIATION > > - metadata annotations > that are orthogonal to the topic of the statement > -> that can be interpreted as REIFICATION > > The metadata use case (quite often characterized as provenance) is strong > in RDF land with its focus on integration of data from heterogeneous > sources. In LPG land much more emphasis is put on structuring the graph > into easy to navigate main relations and their less important details (and > attributed objects, but that’s another topic). Of course that is just a > very rough characterization, and overlaps in both directions are common. > > The metadata use case is well captured by REIFICATION because reification > stays clear of the annotated statement itself (lets keep in mind that > reification is a general concept and don’t associate it with the syntactic > verbosity its implementation in RDF for a moment). There is an air gap > between the statement and its reification that ensures that the original > statement is unencumbered and unchanged by the annotation. This is good for > the metadata use case but it is not easy to understand as recent mail > exchanges on the list between Olaf, Niklas, Bryan and Peter have shown > (again) and the indirection can cause irritating and unfortunate effects. > > In an n-ary relation the main relation can be understood as an > INSTANTIATION of the type of relation it represents, with each instance > having its own secondary attributes as qualifications. Instantiation is a > concept that is well understood and maps nicely to everyday > conceptualizations like "a car" (engine, four wheels, etc) and "my car" > (again engine, four wheels, etc, but also a sedan, blue, old, etc). > Instantiation is what drives the semantics of the singleton property > approach. > > Of course, the distinction is more one of tendencies than of a hard > separation: reification can represent n-ary relations and instantiation via > n-ary relations can represent metadata, one person’s data is another > person’s metadata, etc. However, in both cases that comes at a certain cost > in intuitiveness and naturalness. If used wrongly, subtle breaks can be > introduced that may lead to surprising and undesirable results. > > The problem with the singleton property approach as proposed by Nguyen et > al is that it tries to achieve its goal without a change to the syntax of > RDF. It lacks the boldness of RDF-star to introduce a new term type into > RDF. This makes it verbose, hard to optimize and requiring to entail the > primary relation as if it was an additional detail, an afterthought even. > On the other hand RDF-star was intially a syntax without a well-defined > semantics, or model theory even, and this WG still struggles to make it all > work out. This here is an attempt to re-use the singleton property > approach as the semantic underpinning of the RDF-star syntax, or, put the > other way, to augment the singleton property approach with the RDF-star > syntax, thereby getting rid of its verbosity. So let’s get to it. > > > CORE > ==== > > 1) a return to RDR and pre-CG RDF*: EACH TRIPLE TERM IS ASSERTED, e.g. > > << :s :p :o >> :b :c . > > asserts ' :s :p :o ' and annotates it in one go. This gets rid of the need > for an annotation/shorthand syntax and it safes an extra triple to actually > assert the assertion. It captures the predominant intuition: saying > something and adding detail to it. At this point it doesn’t matter much if > that detail is metadata or qualifying detail. What matters is that both are > solidly connected, not separated (and prone to mixups and misunderstandings > through overlapping multi-edge situations). > A query for { :s :p :o } in SPARQL-star on the above example must retrieve > the statement ':s :p :o' from the triple term << :s :p :o >>, etc - "Turtle > with holes". > This means that in common scenarios there is zero overhead because of > singleton property verbosity and entailments, unasserted assertions, etc. > The main use case is very straightforward to use (and implement, I reckon). > > > 2) the RDR/RDF* proposal is extended with TRIPLE TERM IDENTIFIERS not > unlike the current WG proposal, but with a twist: user-provided identifiers > are handled differently (more on that below). Just as in the current WG > proposal a bnode identifier is provided by the system for every triple > term, e.g. the above '<< :s :p :o >> :b :c .' is equivalent to > > << _:p1 | :s :p :o >> :b :c . > > The triple term is now a QUAD-TUPLE: the identifier becomes part of the > triple term also in model and abstract syntax, getting rid of the abstract > triple term type (the thing syntactically expressed as '<<( :s :p :o )>>' > in the current proposal - however, we will reuse that syntax, see below). > This identifier is equivalent to the singleton property itself in the > approach so named. > The statement identifier, refering to an instance/occurrence of the > abstract statement, is essential to capture the semantics of most use > cases, not the least LPG uses, where statements (or edges in LPG) of the > same type can occur multiple times, each with different and not to be mixed > up sets of annotations. > > The current WG proposal offers to users the possibility to explicitly > define an IRI en lieu of but semantically equivalent to the system provided > bnode, e.g. > > << :x | :s :p :o >> :b :c . > > Its purpose is to work around the limitations of line-based > serializations. We do this too, but in a different way (that’s the TWIST > hinted at above): an explicitly provided identifier is stored separately > from and additionally to the system provided bnode. The quad therefore > conceptually becomes a QUIN-TUPLE - however, stores may choose to just > store the explicit identifier via an extra statement, like in the mapping > discussed next. The rationale behind this arrangement will become apparent > below when we discuss many-to-many relations, sets and graphs. > > > 3) a MAPPING to standard RDF is based on the singleton property approach, > e.g. > > :s :p :o . > :s _:p1 :o . > _:p1 rdf12:singletonOf :p ; # _:p1 is a singleton property of _:p > rdf12:id :x ; # :x is a user-provided identifier > refering to _:p1 > :b :c . > > This should work well through the whole installed base and stack of > RDF/RDFS/OWL/etc, at least in principle (issues e.g. with missing predicate > indexes notwithstanding). [0] have found that singleton properties have > quite favorable properties w.r.t. reasoning (and even more so if, in > contrast to those authors, one interprets singleton annotations not as > constraints but as additional detail). > > > I claim that so far all this is pretty straightforward and covers the vast > majority of real world usage. It is cleaner and more concise than the > current proposal in that it doesn’t separate assertion from annotation, it > saves that extra un-asserted triple in storage and it makes the shorthand > annotation syntax superfluous. > > > Some details are important to understand: > > - rdf12:singletonOf rdfs:subPropertyOf rdf:type . > This reflects the intuition that each singleton is unique, an intuition > that is better expressed as instantiation than as subclassing. > > - the verbosity and optimization troubles of singelton properties, as > evidenced in the mapping, only occur in environments that don’t support > RDF-star triple terms (otherwise what would we need RDF-star for ;-). > > - the mapping loses the strong connection between a statement and its > annotation. Just as with the current WG proposal it is possible to have the > same statement asserted and, e.g. after merging a different source, > annotated but assumedly un-asserted. The latter information will get lost, > making the whole concept of un-asserted assertions brittle and unreliable. > The current WG proposal always has this problem, this proposal here only > when mapping to standard RDF. > > - the main difference between our proposal and the singleton property > approach is that we reverse access: we put the un-annotated statement in > the foreground (by means of the triple term syntax), both in the user > facing syntax and at the implementation level, whereas in the singleton > property approach it has to be entailed from the annotated singleton > statement. This makes our proposal much more straightforward to use and > implement. > > > > EXTENSIONS > ========== > > The current WG proposal tries to cover more ground than just statement > annotation, most notably annotating un-asserted assertions, but also other > stuff that depending on perspecticve seems like low-hanging fruit, > especially annotating sets of statements and referentially opaque statement > annotations. We argue that those are orthogonal demands and should be > implemented in a way that doesn’t complicate the above very simple basic > arrangement. It seems however that it is possible to achieve this with > modest effort. > > > UNASSERTED ASSERTIONS > > We re-use the syntax of abstract triple terms from the current WG proposal > to encode unasserted assertions, as the concept of abstract triple terms is > obsolete in our approach. Like triple terms they are four-tuples, i.e. they > always have an identifier implicitly provided by the system as a bnode. As > the use case is rather niche we consider the introduction of an unasserted > assertion in model and abstarct syntax overkill, but instead advocate to > implement the syntax as syntactic sugar for standard reification, e.g. > > <<( :s :p :o )>> :b :c . > > in standard RDF maps to standard reification: > > _:p2 rdf:type :rdf:Statement ; > rdf:subject :s ; > rdf:predicate :p ; > rdf:object :o ; > :b :c . > > The same for explicitly named un-asserted assertions like e.g. '<<( :x | > :s :p :o )>> :b :c .' > > _:p2 rdf:type :rdf:Statement ; > rdf:subject :s ; > rdf:predicate :p ; > rdf:object :o ; > rdf12:id :x ; > :b :c . > > > REFERENTIAL OPACITY > > Referential opacity has come up again lately and although I’m pretty wary > of the concept I can see a way in which its introduction will probably not > harm RDF in general. Most of all I like to see it as an orthogonal concern > that should not be entagled with annotations or un-asserted assertions as > such. Therefore I take up the idea of introducing yet another syntax (by > Enrico IIRC, in a recent telco), e.g. > > <<" :s :p :o ">> :b :c . > > Again this may be implemented as a new term type in model and abstract > syntax, or we may follow Antoine Zimmermanns proposal for an RDF literal > datatype. The latter can be employed to define referential opacity as > syntactic sugar and map to standard RDF maps as follows: > > :s :p :o . > :s _:p3 :o . > _:p3 rdf12:singletonOf :p ; > :b :c ; > rdf12:ofArtefact ":s :p :o"^^rdf:ttl . > > The RDF literal datatype documents precisely the syntactic representation > of the statement. This is a very un-intrusive approach to referential > opacity and IMHO won’t get in the way of standard RDF procedures. Of course > it doesn’t prevent undesirable entailments from being made, as the approach > to referential opacity taken by the CG proposal does, but at least it > allows to track them back to the original source and treat them > accordingly. Bnodes might either not be handled or be covered more fully > than in teh CG report proposal by allowing artefacts to be concise bounded > descriptions. E.g. an artefact ":s :p _:b1 . _:b1 :d :e , :f ."^^rdf:ttl > would give a full account of the meaning of _:b1 at the time the artefact > was created. > > > REFERENTIALLY OPAQUE UNASSERTED ASSERTIONS > > Again, referential opacity and unasserted assertions are orthogonal > concerns, and therefore yet another syntax is introduced to combine the > two, e.g. > > <<(" :s :p :o ")>> :b :c . > > Following the above proposals this is mapped to standard RDF by adding the > literal representation to the reification quad, e.g. > > _:p4 rdf:type :rdf:Statement ; > rdf:subject :s ; > rdf:predicate :p ; > rdf:object :o ; > rdf12:hasArtefact ":s :p :o"^^rdf:ttl ; > :b :c . > > Considering the mindboggling level of disambiguation that this arrangement > provides the complexity isn’t too bad IMHO ;-) > > > MANY-TO-MANY, SETS, GRAPHS > > Like the current WG proposal this approach doesn’t rule out many-to-many > relations, e.g. > > << :x | :s :p :o >> :b :c . > << :x | :u :v :w >> :b :c . > > We might even consider to introduce a supporting syntax, aka GRAPH TERMS, > e.g. > > > << :s :p :o . > :u :v :w >> :b :c . > > or, explicitly named > > << :x | :s :p :o . > :u :v :w >> :b :c . > > I don’t want to push the envelope too far (given the constraints imposed > by the charter, the controverses around the topic, etc) but it’s good to > see that this is syntactically straightforward - it isn’t with the > shorthand annotation syntax of the WG proposal. > > Anyway, employing the mapping to standard RDF from CORE above, we get a > straightforward definition of the meaning of many-to-many annotations (no > matter if they come as singleton terms or as hypothetical graph terms), > e.g. mapping the above many-to-many relation to > > :s :p :o . > :u :v :w . > :s _:p5 :o . > _:p5 rdf12:singletonOf :p ; > rdf12:id :x ; > :b :c . > :u _:v1 :w . > _:v1 rdf12:singletonOf :v ; > rdf12:id :x ; > :b :c . > > This establishes a FOR-EACH semantics: annotations of the graph term are > annotating each triple, not the graph (or set of triples if one prefers > that slightly looser wording) itself. The same is true for annpotations on > :x: they too are mapped to all statements so named, e.g. > > << :x | :s :p :o . > :u :v :w >> :b :c . > :x :d :e . > > is mapped to > > :s :p :o . > :u :v :w . > :s _:p5 :o . > _:p5 rdf12:singletonOf :p ; > rdf12:id :x ; > :b :c ; > :d :e . > :u _:v1 :w . > _:v1 rdf12:singletonOf :v ; > rdf12:id :x ; > :b :c ; > :d :e . > > > To annotate the set of triples itself one would have to create an explicit > reference via the identifier :x, e.g. > > :x rdf12:asObject [ # or "rdf12:asGraph", "rdf12:asSet" ... > :f :g > ] . > > A possible use case might be to express that a set of statements together > describe a situation, postulate a theory, etc. > This arrangement can also be used to annotate singleton statements as > objects of there own right (not as annotations to the predicate). > Semantically this is probably closer to reification than to n-ary > relations, but I’m not really sure myself what to make of it. In any case > it is more expressive than the current WG proposal which provides no means > to differentiate between the object and its content (httpRange-14 raising > its ugly head again, I guess) > > > However, this arrangement has repercussions on the interpretation of > annotations on :x (or _:x) in x-to-one cases, i.e. when single statements > are annotated, because it can only mean that also those annotations refer > to the statement as a whole, not its n-ary property. This is a departure > from the current state which leaves this question open - and sure to cause > some irritation. > > > TBC… I’m leaving it at this for reasons of time, but also to solicit some > general comments. The details most certainly need some more tweaking, as so > far all proposals did. The means to explicitly name an occurrence were > introduced to overcome the limitations of serialization, but they do open > the door to many-to-many relations, and that comes in handy when discussing > sets and graphs. However it mixes orthogonal concerns, so may have > unintended consequences. I expect this arrangement to be controversial, and > maybe buggy. Comments welcome! > > Best, > Thomas > > [0] https://link.springer.com/chapter/10.1007/978-3-319-58068-5_39 > > > > On 2. May 2024, at 16:00, Peter F. Patel-Schneider < > pfpschneider@gmail.com> wrote: > > > > The singleton property approach has benefits and downsides. The quoted > triple approach has benefits and downsides. > > > > One very big advantage of the singleton property approach is that it is > (barely) possible to use it with any RDF system, even RDF systems that have > no optimizations. A big disadvantage of the quoted triple approach is that > it requires new syntax, new semantics, and new implementations. > > > > One cannot successfully argue that just because the singleton property > approach may require more triples that it is inherently worse than the > quoted triple approach. RDF implementations can be tuned to the singleton > property approach, providing special data structures for singleton > properties and special code to optimize SPARQL queries for the singleton > property approach. > > > > One possible way to do this is to use a special approach for singleton > properties where the internal name of the blank node encodes the parent > property. This could result in minimal or even no storage overhead for > singleton properties. Of course the implementation effort to make this > completely transparent would be significant, but then so is the effort to > make a performative implementation of quoted triples. > > > > I note that in this approach the singleton property triples would look > very much like multiple edges, i.e., this could be considered to be a > space-efficient implementation of RDFn. > > > > peter > > > > > > On 4/30/24 15:46, Thompson, Bryan wrote: > >> Your proposal would require two statements on top of the original SPO > statement before you should begin to make assertions about the original SPO > statement? > >> Anything based on the singleton property approach will have quite an > impact on database statistics. The number of used predicates would jump > from millions (for open linked data) to the cardinality of the statements > about which statements are being made (e.g., billions, 10s of billions, > etc.). @Williams, Gregory <mailto:ngregwil@amazon.com> or @Schmidt, > Michael <mailto:schmdtm@amazon.com> can comment on this, but this > certainly places a new burden on common techniques for extracting > statistics from a graph. > >> Note that there is really no reason to rely on the P position in your > proposal. You could use S since it already allows blank nodes. You then > hang the Subject of the original asserted SPO on the statement about that > unique subject. (Or you could use O, which might be kinder for database > statistics since they tend to focus on SP* analysis.) > >> _:si :statementInstanceHasSubject :s . > >> _:si :p :o . > >> :s :p :o. > >> I have been impressed in the past with the space and time overhead > which arises out of various modeling decisions around possible statements > about statements treatments. I would recommend carefully considering that > impact. Another 2 triples makes a huge difference when all statements > carry annotations, as they do in some domains. For example, consider the > relatively common case in which you have a graph consisting of a topology > and edge weights. This is very common - lots of graphs are simply edges > and their weights. As I understand it, your proposal would have 3 times > the data volume to model the topology (some set of edges) in a manner which > would permit associating edge weights with the edges in that topology. And > the database would need to chase a long chain to obtain those edge weights > in a correct manner: :s :p :o. => :s _:pi :o => _:pi rdfs:subPropertyOf :p > . => _:pi :hasWeight 1.0. The cost of chasing that chain would make > applications relying on edge weights very expensive in both time and > space. I can't see that as being responsive to such use cases. To be > efficient, there needs to be a close association between an edge and the > properties of that edge. Their resolution needs to be very efficient. > >> Also note that this singleton property proposal would not support > alignment in the data (interoperability in the data) with LPG edge > properties. So it would fail to offer a unification path for the common > use cases of RDF and LPG. > >> Thanks, > >> Bryan > >> > ------------------------------------------------------------------------------ > >> *From:* Peter F. Patel-Schneider <pfpschneider@gmail.com> > >> *Sent:* Tuesday, April 30, 2024 10:40:18 AM > >> *To:* public-rdf-star-wg@w3.org > >> *Subject:* RE: [EXTERNAL] The singleton property option > >> CAUTION: This email originated from outside of the organization. Do not > click links or open attachments unless you can confirm the sender and know > the content is safe. > >> I think that this is far too strong. The singleton property approach > has > >> problems, but not to this extent. > >> For any statement that does not require annotation, the singleton > property > >> approach does not require any changes at all, i.e., just use > >> :s :p :o . > >> For a statement that does require annotation, the singleton property > requires > >> two or three triples, one to make the blank node a subproperty of the > desired > >> property, one to state the relationship using the blank node, and, if > the RDF > >> system does not implement RDFS semantics, one to make the statement > using the > >> regular property, i.e., > >> _:pi rdfs:subPropertyOf :p . > >> :s _:pi :o . > >> :s :p :o. > >> The added storage for this might be less than that needed for efficient > >> processing of quoted triples, particularly if the third statement is > not needed. > >> There is no need to change modelling if the statement is annotated > after the fact. > >> peter > >> On 4/30/24 12:26, Thompson, Bryan wrote: > >>> The singleton property approach undermines the direct use of > predicates in > >>> statements and forces a second hop for any use case to determine the > actual > >>> predicate used. It also requires that the "statement" is modeled > differently > >>> in advance, thus increasing the space requirements even if no > statements about > >>> statements are used. > >>> > >>> > >>> This is not efficient. > >>> > >>> > >>> Effectively, the singleton property model says that the RDF triple is > wrong. > >>> It says that you should model using (S ID O) and then model the > predicate and > >>> other information as statements about that ID. This is not the RDF > model. > >>> > >>> > >>> The approach with Statements about Statements should IMHO be built on > (S P O > >>> ID). That is, there is a unique identifier for the SPO and you make > >>> statements about that statement ID. > >>> > >>> > >>> Bryan > >>> > >>> > ------------------------------------------------------------------------------ > >>> *From:* Thomas Lörtsch <tl@rat.io> > >>> *Sent:* Tuesday, April 30, 2024 12:02:21 AM > >>> *To:* public-rdf-star-wg@w3.org; Thompson, Bryan; Niklas Lindström; > RDF-star > >>> Working Group > >>> *Subject:* RE: [EXTERNAL] The singleton property option > >>> > >>> *CAUTION*: This email originated from outside of the organization. Do > not > >>> click links or open attachments unless you can confirm the sender and > know the > >>> content is safe. > >>> > >>> > >>> Brian, > >>> > >>> Niklas combines the RDF-star syntax with the semantics of Singleton > >>> Properties. AFAIK no implementations of or papers on Singleton > Properties have > >>> done that. This combination doesn't even require an index on > properties. > >>> > >>> This combination is nearer to the original RDR approach than anything > else > >>> discussed by CG and WG. It is IMO a very neat idea and deserves a > closer look. > >>> > >>> Thomas > >>> > >>> > >>> > >>> Am 29. April 2024 19:06:37 MESZ schrieb "Thompson, Bryan" < > bryant@amazon.com>: > >>> > >>> The singleton property approach has many downsides and is > pragmatically > >>> unworkable. There is a good reason people are not happy with > this approach. > >>> > >>> > >>> Bryan > >>> > >>> > ------------------------------------------------------------------------------ > >>> *From:* Niklas Lindström <lindstream@gmail.com> > >>> *Sent:* Friday, April 26, 2024 2:08:41 PM > >>> *To:* RDF-star Working Group > >>> *Subject:* [EXTERNAL] The singleton property option > >>> CAUTION: This email originated from outside of the organization. > Do not > >>> click links or open attachments unless you can confirm the sender > and know > >>> the content is safe. > >>> > >>> > >>> > >>> For completeness (and perhaps to widen the perspective), here is > the > >>> singleton property option I briefly mentioned on the semantics > call > >>> (and alluded to in [1]). Also see [2] for the original; this is > just a > >>> quick strawman adaptation for the benefit of the LPG perspective. > >>> > >>> It extends RDF 1.1 differently; no triple terms, no opacity, just: > >>> > >>> 1. Allow bnodes as predicates (blank predicates). > >>> 2. Define rdf:singletonPropertyOf for linking those to the > property > >>> they represent instances/occurrences/edges of. > >>> > >>> 3. Well-formedness conditions: > >>> 3.1 Bnode predicates are only to be used once; with one s and o > >>> (similar to list cons nodes, who are "single purposed"). > >>> 3.2 The rdf:singletonPropertyOf is semantically functional > (exactly > >>> like rdf:first and rdf:rest). > >>> > >>> 4. For optimization, implementations can put triples with blank > >>> predicates in a dedicated table (using edgename as unique key), > >>> relying on well-formedness for cohesion. Such a table is > completed in > >>> two steps: 1) the singleton assertion inserts s and o for > edgename; 2) > >>> the rdf:singletonPropertyOf assertion inserts p for edgename. If > >>> well-formedness is broken, all optimization bets are off. Perhaps > a > >>> dedicated skolemization scheme can be employed for some more > control > >>> and/or "unstarring". > >>> > >>> 5. RDF-star syntax obviously needs no naming syntax; naming these > >>> would break well-formedness. > >>> 6. What these *mean* of course needs a good definition (property > >>> specializations, edge type instances or similar). Are they > asserted? > >>> Sure. Do they assert something using their rdf:singletonPropertyOf > >>> property as predicate? No. (Could they? Well, they can be declared > >>> ("inline") to *also* be subPropertyOf the same property, and > through > >>> entailment that would happen.) > >>> 7. Reifiers become a usage pattern (informative) as suggested > from the > >>> property edge perspective. Any desired :reifiedBy or :partOf > relation > >>> can link predicate singletons to one or more "reifiers". > >>> > >>> Basic example: > >>> > >>> << :s :p :o >> :source <stream662be7ba> ; > >>> :timestampMills 1714153402 . > >>> > >>> Expands to: > >>> > >>> :s _:e1 :o . > >>> _:e1 rdf:singletonPropertyOf :p ; > >>> :source <stream662be7ba> ; > >>> :timestampMills 1714153402 . > >>> > >>> Annotation syntax: > >>> > >>> :s :p :o {| :reifiedBy <#reifier> |} . > >>> > >>> Expands to: > >>> > >>> :s :p :o . > >>> :s _:e1 :o . > >>> _:e1 rdf:singletonPropertyOf :p ; > >>> :reifiedBy <#reifier> . > >>> > >>> Possible singleton property entailment?: > >>> > >>> _:e1 a rdf:SingletonProperty; > >>> rdf:subject :s ; > >>> rdf:prediate :p ; > >>> rdf:object :o . > >>> > >>> Will entailment break well-formedness if (accidentally?) *put > back* > >>> into a regular graph? Of course, just as RDF lists are "broken" > >>> whenever that happens (as in look terrible when serialized, make > no > >>> sense when queried, etc.). > >>> > >>> Best regards, > >>> Niklas > >>> > >>> [1]: > >>> < > https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html > >>> < > https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html > >> < > https://lists.w3.org/Archives/Public/public-rdf-star-wg/2024Apr/0158.html > >>> > >>> [2]: <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/ > >>> <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/ > >> <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350149/>>> > >>> > > > > >
Received on Thursday, 23 May 2024 19:47:30 UTC