W3C home > Mailing lists > Public > public-lod@w3.org > August 2015

Re: [ANN] nature.com/ontologies - July 2015 Release

From: Hugh Glaser <hugh@glasers.org>
Date: Fri, 7 Aug 2015 12:48:32 +0100
Cc: "public-lod@w3.org" <public-lod@w3.org>
Message-Id: <389CE2F7-5858-4144-A952-7EEE23296C1D@glasers.org>
To: "Hammond, Tony" <Tony.Hammond@Macmillan.com>
Hi Tony,
Great stuff!
So I start exploring, looking for more fodder for sameAs.org … :-)

It may be that my questions are too specific for the list - feel free to go off-list in response, and then we can summarise.
And there is rather a lot here, I’m afraid.

Some possible problemettes I hit:
http://www.nature.com/ontologies/datasets/articles/#data_example
might be confusing for people (and awkward when I tried to rapper it).
Since quite a few prefixes are not declared, most notably one of yours: npg, but also the usual suspects (xsd, dc, bibo, foaf and also prism).
There is also a missing foaf:homepage that causes a syntax error.
And some semi-colons missing off the last few lines.

A slightly more challenging problem is that the URI for that example doesn’t resolve.
It unqualifies to http://ns.nature.com/articles/nrg3870 (I assume it is a namespace problem.)

But I managed to find a resolving URI: http://dx.doi.org/10.1038/246015a0 (from http://www.nature.com/ontologies/mappings/articles-dbpedia/)
And successfully got some RDF :-)
Looking at the owl:sameAs triples in there, I then start to worry - they are urn:, doi: and info: URIs.
This is fine for a Semantic Web publishing, but means (in my opinion) that it is not Linked Data (violating principle 2) - all URIs for Things have to be http: for that. So you could use another predicate, but owl:sameAs seems wrong.

Having found http://www.nature.com/ontologies/mappings/, I excitedly went off to http://www.nature.com/ontologies/mappings/articles-dbpedia/ and downloaded the files.
However, I found that the file contains only triples with foaf:topic and cito:isCitedBy - no mappings between your URIs and DBpedia, Mesh, etc…, which is what I was expecting. It seems to me that this is more of a “links” file than a “mappings” file.
Even more frustrating, many URIs that I tried, such as http://dx.doi.org/10.1038/ng1285, don’t resolve (give an invalid doi: message), so I wouldn’t be able to use them in any case.

So… :-)
Apart from any fixing you may want to do; and maybe having some example Linked Data URIs for People and Publications sprinkled around (it really was quite a challenge to find any RDF!).
I suspect that you do have some mappings between your URIs and dbpedia URIs, for example.
Is there any chance you would like to send me (or link to a file) any Linked Data owl:sameAs triples that I could add to sameAs.org, please?
And also, do you have an interesting owl:differentFrom dataset that I could add to differentFrom.org? This might be your regression test, where you have done mappings that you later found were wrong.

Sorry if I have made some basic errors in failing to find things.
Very best
Hugh
> On 28 Jul 2015, at 12:41, Hammond, Tony <Tony.Hammond@Macmillan.com> wrote:
> 
> Hi:
> 
> As promised in a couple posts to the list earlier this year [1,2] we have now resumed dataset publishing (following on from the 2012 releases on data.nature.com [3,4]) and have added a new snapshot of bibliography metadata for nature.com articles to the Nature.com Ontologies portal:
> 
>     http://nature.com/ontologies/
>  
> Specifically, as announced yesterday [5], we have released 170 years of bib data for all nature.com articles and contributors over the period 1845-2015. We expect to release new snapshots periodically.
> 
> The release notes are copied below.
> 
> Tony
> 
> 
> ==
> Release:
> 
> * We've resumed publishing of datasets. We're now making available complete instance datasets for articles (1.2 m) and contributors (2.7 m). These datsets are linked to the DOI and ORCID datasets. (These datasets replace the historic datasets from 2012.)
> 
> * We've now added our core and domain models to GitHub projects and brought them under version control: public-npg-core-ontology  GitHub and public-npg-domain-ontology GitHub
> 
> * We've improved our documentation. A whole new Technical Notes section has been added. Some material from the homepage (e.g. Background, Licenses, Namespaces) has been moved there, and new material has been added (e.g. Annotations, Mappings, Naming Policy, Versions).
> 
> * We've improved our data mappings. The Subjects Ontology is now 100% mapped to DBpedia. See Mappings.
> 
> * We've added a reference in our Links section to our new colleagues at Springer and their LOD for Conferences in Computer Science.
> ==
>   
> 
> [1] https://lists.w3.org/Archives/Public/public-lod/2015Apr/0005.html
> [2] https://lists.w3.org/Archives/Public/public-lod/2015May/0002.html
> [3] https://lists.w3.org/Archives/Public/public-lod/2012Apr/0061.html
> [4] https://lists.w3.org/Archives/Public/public-lod/2012Jul/0130.html
> [5] https://twitter.com/tonyhammond/status/625641560676409345
> 
> ********************************************************************************  
> DISCLAIMER: This e-mail is confidential and should not be used by anyone who is not the original intended recipient. If you have received this e-mail in error please inform the sender and delete it from your mailbox or any other storage mechanism. Neither Macmillan Publishers Limited nor Macmillan Publishers International Limited nor any of their agents accept liability for any statements made which are clearly the sender's own and not expressly made on behalf of Macmillan Publishers Limited or Macmillan Publishers International Limited or one of their agents. 
> Please note that neither Macmillan Publishers Limited nor Macmillan Publishers International Limited nor any of their agents accept any responsibility for viruses that may be contained in this e-mail or its attachments and it is your responsibility to scan the e-mail and 
> attachments (if any). No contracts may be concluded on behalf of Macmillan Publishers Limited or Macmillan Publishers International Limited or their agents by means of e-mail communication. 
> Macmillan Publishers Limited. Registered in England and Wales with registered number 785998. Macmillan Publishers International Limited. Registered in England and Wales with registered number 02063302. 
> Registered Office Brunel Road, Houndmills, Basingstoke RG21 6XS 
> Pan Macmillan, Priddy and MDL are divisions of Macmillan Publishers International Limited. 
> Macmillan Science and Education, Macmillan Science and Scholarly, Macmillan Education, Language Learning, Schools, Palgrave, Nature Publishing Group, Palgrave Macmillan, Macmillan Science Communications and Macmillan Medical Communications are divisions of Macmillan Publishers Limited. 
> ********************************************************************************
> 

-- 
Hugh Glaser
   20 Portchester Rise
   Eastleigh
   SO50 4QS
Mobile: +44 75 9533 4155, Home: +44 23 8061 5652
Received on Friday, 7 August 2015 11:49:20 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:22:25 UTC