W3C home > Mailing lists > Public > public-openannotation@w3.org > August 2012

Re: named entity annotation / certainty / provenance

From: James Chartrand <jc.chartrand@mac.com>
Date: Thu, 16 Aug 2012 10:44:16 -0400
Cc: public-openannotation@w3.org
Message-id: <74AF66AE-C6C6-4762-849E-DBA0C748F43F@mac.com>
To: Paolo Ciccarese <paolo.ciccarese@gmail.com>
Thank you Paolo.  I did find this example of asserting certainty (confidenceValue) in the PROV examples:

from http://www.w3.org/TR/prov-o/#Source:

@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd:  <http://www.w3.org/2001/XMLSchema#> .
@prefix owl:  <http://www.w3.org/2002/07/owl#> .
@prefix prov: <http://www.w3.org/ns/prov#> .
@prefix :     <http://example.com/> .

:myPost 
   a prov:Entity;
   prov:hadOriginalSource :donQuixote;
   prov:qualifiedSource [
      a prov:Source;
      prov:entity :donQuixote;
      :confidenceValue "6"^^xsd:integer;
      rdfs:comment """Not sure if Don Quixote was the original source, 
                      so asserting a confidence value of 6 out of 10.""";
   ];
.

:donQuixote a prov:Entity.

james

On 2012-08-16, at 10:06 AM, Paolo Ciccarese <paolo.ciccarese@gmail.com> wrote:

> Hi James,
> 
> On Thu, Aug 16, 2012 at 9:28 AM, James Chartrand <jc.chartrand@mac.com> wrote:
> We're annotating named entity references (to people, places, events, organizations) in text like so:
> 
> <AnnoA> a oa:Annotation ;
>    oa:hasTarget <urn:uuid:CB37E17E-B1EF-48CE-B098-0CE8FAA86A6D> ;
>    oax:hasSemanticTag <http://dbpedia.org/page/MrJones> ;
>    oa:annotator <http://somewhere.ca/people/JamesChartrand> ;
>    oa:annotated "2012-08-15T12:10:54Z" ;
>    oa:generator <someSoftwareURI> ;
>    oa:generated "2012-08-15T12:10:54Z" ;
>    oa:modelVersion <http://openannotation.org/spec/core/20120509> .
> 
>   <urn:uuid:CB37E17E-B1EF-48CE-B098-0CE8FAA86A6D> a oa:SpecificResource ;
>     oa:hasSource <http://somewhere.ca/work/345345> ;
>     oa:hasSelector <urn:uuid:2ADFF7EE-AB19-4BA3-94EC-55EE0BA645C2> .
> 
>   <urn:uuid:2ADFF7EE-AB19-4BA3-94EC-55EE0BA645C2> a oax:TextOffsetSelector ;
>     oax:offset 244 ;
>     oax:range 7 .
> 
>   <http://somewhere.ca/work/345345> a dctypes:Text .
> 
>  Three questions:
> 
> 1.  Does this seem right?
> 
> 
> At first glance I would say yes.
>  
> 2.  We'd like to assert the confidence with which we're annotating -- how certain we are that this reference is to the given person.  Are there any recommendations for asserting certainty?
> 
> I would suggest to create your own property or mechanism and then contribute it back to the group. That is certainly something others are doing. In my project I have a score property (score that is coming from text mining algorithms) attached to the Annotation node. It would be interesting to collect the different use cases on this topic on the wiki.
>  
> 
> 3.  Should the oa provenance predicates be used in preference to the PROV ontology (http://www.w3.org/TR/prov-o/) or can PROV be used interchangeably?  Is PROV recommended for provenance predicates not covered by OA?  Any plans to move to PROV entirely?
> 
> We worked out some mappings 
> oa:annotator subPropertyOf prov:wasAttributedTo
> oa:annotated subPropertyOf prov:generateAtTime
> oa:generator subPropertyOf prov:wasAttributedTo
> oa:generated subPropertyOf prov:generatedAtTime.
> 
> I believe we will include/publish them at one point. However, as you can see, the expressiveness of the prov properties is not the same.
>  
> Best,
> Paolo
> 
Received on Thursday, 16 August 2012 14:44:44 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 16 August 2012 14:44:44 GMT