RE: RDF-star vs Wikidata for modelling Richard Burton

Isn't this example just a modelling issue? 

Multiple marriages between the same two people are nevertheless still multiple marriages. 
So if the history was described in a marriage-centric way rather than a person-centric way, then everything would be 'easy'. 

-----Original Message-----
From: Peter Patel-Schneider <pfpschneider@gmail.com> 
Sent: Saturday, 4 December, 2021 01:42
To: thomas lörtsch <tl@rat.io>; public-rdf-star@w3.org
Subject: RDF-star vs Wikidata for modelling Richard Burton

Although there is quite a bit about the Wikidata (actually Wikibase) data model that I disagree with, I don't think that it is fair to say that it is horrible.

As far as I can tell from
https://www.mediawiki.org/wiki/Wikibase/DataModel, there is no ordering of the statements for an entity in Wikibase nor is there any ordering of the statements for a property of an entity in Wikibase.  The ordering that one sees on Wikidata pages is simply an artifact of the display.  Although there may be some information in Wikidata that drives this ordering it is not representationally significant and can change without affecting the meaning of the data in Wikidata.  (Well, at least the information about Richard Burton, would not change simply because ordering in the display of his marriage information changes.)

Where Wikibase is different from RDF* is that there is no guarantee that there is only one statement (Wikibase's rough analogue of a
triple) with a given entity (subject), mainSnak property (predicate), and mainSnak value (object).  This can be seen in Richard Burton's spouses where there are two statements with entity Richard Burton, mainSnak property spouse, and value Elizabeth Taylor.  My understanding is that having no auxiliary information (such as start and end times) associated with these two statements would be something that should be somehow fixed up, but as there are different start and end times for these two statements then everything is in order.

So this use case (marrying the same person more than once) works very well in Wikibase, but at the cost of potentially having what might be called duplicate triples.  Users of WIkidata should be aware of this possibility and arrange to do whatever their right thing is when they encounter this situation.

RDF* does not handle this nearly as well.  There is discussion on this very point in https://github.com/w3c/rdf-star/issues/36


Note that I'm not saying that the best way to model serial-monagamy- with-possibility-of-remarriage-to-the-same-person is via spouse statements.  I'm just saying that Wikibase can handle decorated spouse triples much better than RDF* can.  This use case provides an example where uniqueness hurts for more than just beliefs or provenance and shows that the :occurence solution has big problems.

peter


On Fri, 2021-12-03 at 12:22 +0100, thomas lörtsch wrote:
> 
> 
> Am 3. Dezember 2021 11:40:10 MEZ schrieb Pierre-Antoine Champin
> <pierre-antoine.champin@ercim.eu>:
> > On 03/12/2021 09:23, Dan Brickley wrote:
> > 
> > > Yes, nice and concrete!
> > > 
> > > What if Alice were Elizabeth Taylor, and Bob were Richard Burton
> > 
> > great minds think alike... here is the example I recently proposed 
> > for addition in the spec:
> > 
> > https://pr-preview.s3.amazonaws.com/w3c/rdf-star/pull/225.html#marri

> > ed-example
> 
> 
> You use a new property :occurrence instead of :occurrenceOf in the 
> example before that example. Is there a reason for that?
> 
> Also you use the abbreviated syntax. It might be helpful to use the 
> un- abbreviated syntax in all examples except one section where the 
> abbreviated syntax is explained. Your example does however show that 
> the abbreviated syntax doesn't make anything better w.r.t. the 
> wikidata usecase.
> 
> The way wikidata models this is arguably even more horrible. It is an 
> ordered list with marriages of Richard Burton. Each marriage is a list 
> entry which could be a blank node, etc. Elisabeth Taylor presumably 
> has her own list of marriages and nothing but ardent querying connects 
> the two lists I suppose.
> 
> 
> > and that was before reading Ora's use-cases... ;)
> 
> The wikidata use case has been part of the RDF* use cases for a year 
> now, or two?
> 
> 
> Maybe it would be helpful to name this problem properly - not 
> "wikidata usecase" but rather "multiset problem", or "multiset use case"?
> 
> 
> Best,
> Thomas
> 
> 
> > > Wikidata's data model records their two marriages as edge 
> > > annotations, on the
> > > https://m.wikidata.org/wiki/Property:P26 relationship linking them.
> > > 
> > > Eg. https://m.wikidata.org/wiki/Q34851

> > > https://m.wikidata.org/wiki/Q151973

> > > 
> > > From a Schema.org point of view, being able to express more of 
> > > what Wikidata can do seems attractive for interop.
> > > 
> > > Dan
> > > 
> > > In Burton's entry we have :
> > > spouse
> > > 
> > > Elizabeth Taylor
> > > start time 15 March 1964
> > > end time 26 June 1974
> > > series ordinal 2
> > > 1 reference
> > > The Peerage person ID p33443.htm#i334430 retrieved 7 August 2020
> > > 
> > > Sybil Christopher
> > > start time 5 February 1949
> > > February 1949
> > > end time 5 December 1963
> > > series ordinal 1
> > > 1 reference
> > > The Peerage person ID p33443.htm#i334430 retrieved 7 August 2020
> > > 
> > > Suzy Miller
> > > end time 1982
> > > start time 21 August 1976
> > > series ordinal 4
> > > 0 references
> > > 
> > > Elizabeth Taylor
> > > start time 10 October 1975
> > > end time 29 July 1976
> > > series ordinal 3
> > > 0 references
> > > 
> > > Sally Burton
> > > start time 3 July 1983
> > > end time 5 August 1984
> > > series ordinal 5
> > > 0 references
> > > 
> > > 
> > > On Thu, 2 Dec 2021, 21:12 Jos De Roo, <josderoo@gmail.com> wrote:
> > > 
> > >     Hi Ora,
> > > 
> > >     Very nice use cases and to me it looks quite natural to 
> > > express
> > >     them as
> > > 
> > >     :Bob :isMarriedTo :Alice .
> > >     [] :repr << :Bob :isMarriedTo :Alice >>; :since 2020 ; :source
> > >     :NYTimes .
> > >     [] :repr << :Bob :isMarriedTo :Alice >>; :since 2021 ; :source
> > >     :WashingtonPost .
> > > 
> > >     :M1 :pipe :M2 .
> > >     [] :repr << :M1 :pipe :M2 >>; :size "DN 100"; :schedule "30" .
> > >     [] :repr << :M1 :pipe :M2 >>; :size "DN 125"; :schedule "10" .
> > > 
> > > 
> > >     Kind regards,
> > >     Jos
> > > 
> > >     -- https://josd.github.io

> > >     <https://josd.github.io>
> > > 
> > > 
> > >     On Thu, Dec 2, 2021 at 7:44 PM Lassila, Ora <ora@amazon.com>
> > > wrote:
> > > 
> > >         Folks,
> > > 
> > >         Attached is a document that outlines a couple of uses 
> > > cases
> > >         (variants of one modeling pattern ,really) we would like 
> > > to
> > >         submit for consideration by the upcoming RDF-star Working
> > >         Group. I am submitting these now just in case this turns 
> > > out
> > >         to be relevant to how the charter gets written. Comments 
> > > are
> > >         welcome, and I am happy to discuss these use cases 
> > > whenever.
> > > 
> > >         Regards,
> > > 
> > >         Ora
> > > 
> > >         --
> > > 
> > >         Dr. Ora Lassila
> > > 
> > >         Principal Graph Technologist, Amazon Neptune
> > > 
> > >         Amazon Web Services
> > > 
> > >         ora@amazon.com
> > > 
> 

Received on Sunday, 5 December 2021 21:47:10 UTC