RE: Naming Conventions for URIs from Hans Teijgeler on 2015-08-22 (semantic-web@w3.org from August 2015)

From: Hans Teijgeler <hans.teijgeler@quicknet.nl>
Date: Sat, 22 Aug 2015 19:38:40 +0200
To: "'David Booth'" <david@dbooth.org>
Cc: "'Paul Houle'" <ontology2@gmail.com>, <semantic-web@w3.org>, "'Discussion list for the Wikidata project.'" <wikidata-l@lists.wikimedia.org>
Message-ID: <C9F504C0504C474381B88D07FD5DF9E7@HansPC>
Hi David,

"Slightly modified" data are modified data (one can't be slightly pregnant
either), and that means different information that shall not have the same
URI if you want the history recorded.
We deprecate the old information and make the next version effective on that
same dateTime.
The URI for Alice's address will indeed be the same:  v:SOURCE ex:alice ,
what made you think that my proposal would lead to a different URI? It will
lead to another URI for modifed information about Alice's address.
We use this method to put on record the lifecycle information of a process
plant (e.g. an oil refinery, chemical plant), with tens of thousands of
objects in changing design and configurations.
But for the same token this method can be used for Patient Records, real
estate, air planes, etc.
Just have a peek in <http://15926.org>. But I won't try to convert you. I
rest my case.

Regards,
Hans

PS We don't have blank nodes whatsoever. 
Besides, to your "multiple motivations for deriving URIs from natural keys
and properties" this: an old wisdom in data modeling is that you shall not
hide information in IDs.

Hans Teijgeler,
Laanweg 28,
1871 BJ Schoorl,
Netherlands
15926.org <http://15926.org>

-----Original Message-----
From: David Booth [mailto:david@dbooth.org] 
Sent: vrijdag 21 augustus 2015 21:58
To: Hans Teijgeler; 'Paul Houle'; semantic-web@w3.org; 'Discussion list for
the Wikidata project.'
Subject: Re: Naming Conventions for URIs

Hi Hans,

You can (with high probability) generate unique URIs that way, but part of
the reason for generating them in a predictable way is ensure that if the
same or only slightly modified data is regenerated later, the URI for
Alice's address will still be the same.  This is helpful because you might
already have other data that uses that URI, and you might merge new and old
data.  When merging, it is helpful if the node URIs match. 
Otherwise you wind up with (effectively) duplicate triples. 
Incidentally, this is also one of the reasons that blank nodes cause
problems: they also result in (effectively) duplicate triples.

In short, I think there are multiple motivations for deriving URIs from
natural keys and properties:

  - They can be mnemonic and hackable, which makes them friendlier for data
browsing and debugging.

  - They are predictable, which is better for data merging.

  - They are SPARQL friendly (as compared with blank nodes).

David Booth

On 08/21/2015 03:18 PM, Hans Teijgeler wrote:
> David,
>
> What about using Defining N-ary Relations on the Semantic Web 
> <http://www.w3.org/TR/swbp-n-aryRelations/> Use Case 3 ?
>
> @prefix : <http://example/mine/address/> .
> @prefix ex: <http://example/other/> .
> @prefix v: <https://tools.ietf.org/html/rfc6350#section-10.2.6>
>
> :T78F7E2EB48C044CCB5474C072F5573DB rdf:type :SnailMailAddress, 
> owl:Thing; # T+GUID
>      v:SOURCE ex:alice ;
>      v:FN "Alice Doe" ;
>      v:ADR "45 Park St, Shadyville, USA" .
>
> Something like that.
>
> Regards, Hans
>
> Hans Teijgeler,
> Laanweg 28,
> 1871 BJ Schoorl,
> Netherlands
> 15926.org <http://15926.org>
>
> -----Original Message-----
> From: David Booth [mailto:david@dbooth.org]
> Sent: vrijdag 21 augustus 2015 17:31
> To: Paul Houle; semantic-web@w3.org; Discussion list for the Wikidata 
> project.
> Subject: Re: Naming Conventions for URIs
>
> On 08/20/2015 11:36 AM, Paul Houle wrote:
> [ . . . ]
>  > The production for a QName  cannot begin with a number so it is not  
> > correct to write something like  >  > dbpedia:100  >  > or expect to 
> have the full URI squashed to that.  This kind of gotcha  > will drive 
> newbies nuts,  and the realization of RDF as a universal  > solvent 
> requires squashing many of them.
>
> I agree.  And although (as Andy pointed out) this particular issue has 
> been fixed in SPARQL 1.1 and Turtle 1.1, last I checked not all tools 
> had been upgraded to those specs, so it still remains a problem in 
> practice.  Personally, as a workaround when converting a natural key 
> to a URI, I always prefix the key with an alpha string that suggests 
> what it is.  For example instead of the above I might write:
>
> dbpedia:dbp_100
>
> Another place that URI creation and management causes unnecessary 
> headache is when you want to mint a new URI that is relative to an 
> existing URI that you don't control.  For example, suppose someone 
> else gives me a URI for Alice:
>
>     <http://example/other/alice> a foaf:Person .
>
> My software wants to add some triples about Alice's address, such as:
>
>     <http://example/other/alice> v:address _:b
>     _:b v:street "Park St" .
>     _:b v:city "Shadyville" .
>
> For SPARQL and other reasons it would be better to use a URI like 
> <http://example/other/alice/address> instead of a blank node _:b for 
> Alice's address, such as:
>
> <http://example/other/alice>
> v:address <http://example/other/alice/address> .
>     <http://example/other/alice/address> v:street "Park St" .
>     <http://example/other/alice/address> v:city "Shadyville" .
>
> But since I do not control Alice's URI, I cannot safely mint a URI 
> that is relative to Alice's URI without the consent of the original 
> URI's owner.  (Otherwise I would be URI squatting, which is bad 
> practice and
> risky.)
>
> Intuitively I would like the new URI to be somehow derived from 
> Alice's URI.  I could mint a URI from my own URI space 
> <http://example/mine/>, by concatenating Alice's URI (after escaping) onto
mine, such as:
>
>     <http://example/mine/address#http://example/other/alice>
>
> but now I don't have an easy PREFIXed way to write that URI in Turtle 
> or SPARQL.  :(  Furthermore, I have to be careful to properly escape 
> the original URI before concatenating it, because it might contain 
> characters that are not allowed in a fragment identifier.
>
> In my experience URI allocation and management is one of the most 
> annoying practical aspects of working with RDF, as compared with JSON, 
> for example, where one can just put blinders on and ignore the need 
> for global identification.  It would be great if some PhD student or 
> other creative person could figure out a good solution to reduce this 
> pain and make RDF easier for a wider audience to use.  (New 
> conventions?  A new RDF serialization?  Extend RDF?)  JSON-LD is a 
> good step, but still doesn't solve the problem.
>
> David Booth
>
Received on Saturday, 22 August 2015 17:39:11 UTC