Re: Notes from today's meeting from Alasdair J G Gray on 2013-06-04 (public-semweb-lifesci@w3.org from June 2013)

From: Alasdair J G Gray <Alasdair.Gray@manchester.ac.uk>
Date: Tue, 4 Jun 2013 10:06:25 +0100
To: Amrapali J Zaveri <amrapali.j.zaveri@gmail.com>
Cc: Michel Dumontier <michel.dumontier@gmail.com>, Jerven Bolleman <me@jerven.eu>, "public-semweb-lifesci@w3.org" <public-semweb-lifesci@w3.org>
Message-Id: <AA659346-197C-4979-AF51-DD18E0DAF86B@manchester.ac.uk>

thanks for the link. Looks really useful. I'll have a proper look at it later in the day.

Alasdair

On 3 Jun 2013, at 18:15, Amrapali J Zaveri <amrapali.j.zaveri@gmail.com> wrote:

> Hi all,
> 
> I just wanted to point you to this checklist: http://goo.gl/K642I, which might be helpful.
> 
> Thanks.
> Regards,
> Amrapali Zaveri
> http://aksw.org/AmrapaliZaveri
> 
> 
> On Mon, Jun 3, 2013 at 6:51 PM, Michel Dumontier <michel.dumontier@gmail.com> wrote:
> Hi Jerven,
> 
> 
> On Mon, Jun 3, 2013 at 6:30 PM, Jerven Bolleman <me@jerven.eu> wrote:
> Hi All,
> 
> I wanted to discuss one more thing that has been decided in an earlier meeting.
> And that is the choice for dcterms:created or pav:createdOn.
> As a large data provider I want to only share the date that I published the data on.
> i.e. dcterms:issued. Could we change the must to include issued next to created or createdOn.
> 
> This is also a crucial date for the general public while created is not. (e.g. for patent court cases date of publication is critical, the day the file was internally ready is not)
> 
> 
> under the availability section, we have yet to discuss "issued".  From a provenance perspective, "created" is primary metadata, and may coincide with issued for some cases.
> 
>  
> Also continuing the discussion on e-mail that started on the call.
> We should have a clear definition of data item if we are going to record information about such things. e.g. baseURI, what happens if we have 2 data item types in a single dataset?
> 
> 
> ultimately, what i want is to :
> i) to validate the syntax of identifier in some dataset or cross reference (legacy, RDF)
> ii) to compose a URI from a preferred or alternative prefix and an identifier   (legacy to RDF)
> iii) to decompose a URI to a preferred prefix and identifier pair (RDF to legacy)
> iv) to translate one URI pattern to another URI pattern (RDF)
> 
>  
> About void:inDataset I personally don't like it. I suspect it would cost me a 13% growth in triple size for negligible benefits. This also means that the dataset description starts to affect the data. Although I could only present this in the rest / linked data interface and not in the sparql endpoint. I am worried that I can not put it into the FTP data dump rdf. As the data item concept does not map 1:1 on a set of triples that are atomic. 
> 
> 
> i'm not sure that i completely understand your objection. the primary use of void:inDataset is to link data items to the dataset description, and as such supports linked data applications without looking at the graph for a potential, but un-guaranteed provenance description. Using void:inDataset is normal practice in the RDF / linked data community. It would be strange to not include it in any RDF dataset if you have the dataset description.
> 
> http://www.w3.org/TR/void/#backlinks
> 
>  
> e.g. someone can use just the UniProtKB sequences. Once they did that is it still the same dataset that I published it as? I don't think so. Which means uniprot end users need to be careful to remove more triples. Which why I disagree with alasdair's call for MUST.
> 
> 
> if one wanted to know which version/issue of uniprot that the sequences came from, it would be necessary to provide access to the dataset description. if the void:inDataset predicate is used, the user need not even retrieve that to store locally, as you should provide resolution services to those dataset descriptions.
> 
> m.
>  
> Regards,
> Jerven
> 
> Regards,
> Jerven
> 
> 
> Regards,
> Jerven
> 
> 
> 
> 
> 
> 
> 
> On Mon, Jun 3, 2013 at 6:11 PM, Alasdair J G Gray <Alasdair.Gray@manchester.ac.uk> wrote:
> Hi All,
> 
> Seems that fuse threw everyone out at the crucial moment. We will pick up the void:inDataset discussion next week as well as addressing the provenance section.
> 
> Attached are the notes I made during today's call.
> 
> Alasdair
> 
> 
> 
> Dr Alasdair J G Gray
> Research Associate
> Alasdair.Gray@manchester.ac.uk
> +44 161 275 0145
> 
> http://www.cs.man.ac.uk/~graya/
> 
> Please consider the environment before printing this email.
> 
> 
> 
> 
> 
> -- 
> Jerven Bolleman
> me@jerven.eu
> 
> 
> 
> -- 
> Michel Dumontier
> Associate Professor of Bioinformatics, Carleton University
> Chair, W3C Semantic Web for Health Care and the Life Sciences Interest Group
> http://dumontierlab.com
> 

Dr Alasdair J G Gray
Research Associate
Alasdair.Gray@manchester.ac.uk
+44 161 275 0145

http://www.cs.man.ac.uk/~graya/

Please consider the environment before printing this email.

Received on Tuesday, 4 June 2013 09:06:36 UTC