W3C home > Mailing lists > Public > public-openannotation@w3.org > August 2012

Re: named entity annotation / certainty / provenance

From: Paolo Ciccarese <paolo.ciccarese@gmail.com>
Date: Thu, 16 Aug 2012 10:51:34 -0400
Message-ID: <CAFPX2kB7dik-S2OYf2V01buMRN12Bf9wTneyCUZEt+QMt38rfg@mail.gmail.com>
To: James Chartrand <jc.chartrand@mac.com>
Cc: public-openannotation@w3.org
Ah, interesting example!

Regarding the confidence they used an 'example' namespace as far as I see.
I guess my recommendation is still valid. Do what you need to do for your
use case and, if you feel that can be good also for others, please
contribute it back to the list. Personally I always needed to record a
single score but I am sure there are counter examples.

Thank you Sebastian for the feedback, I updated the example in the Cookbook
accordingly:
http://www.w3.org/community/openannotation/wiki/SE_Semantically_Tagging_an_Image

Paolo

On Thu, Aug 16, 2012 at 10:44 AM, James Chartrand <jc.chartrand@mac.com>wrote:

> Thank you Paolo.  I did find this example of asserting certainty
> (confidenceValue) in the PROV examples:
>
> from http://www.w3.org/TR/prov-o/#Source:
>
> @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
> @prefix xsd:  <http://www.w3.org/2001/XMLSchema#> .
> @prefix owl:  <http://www.w3.org/2002/07/owl#> .
> @prefix prov: <http://www.w3.org/ns/prov#> .
> @prefix :     <http://example.com/> .
>
> :myPost
>    a prov:Entity;
>    prov:hadOriginalSource :donQuixote;
>    prov:qualifiedSource [
>       a prov:Source;
>       prov:entity :donQuixote;
>       :confidenceValue "6"^^xsd:integer;
>       rdfs:comment """Not sure if Don Quixote was the original source,
>                       so asserting a confidence value of 6 out of 10.""";
>    ];
> .
>
> :donQuixote a prov:Entity.
>
> james
>
> On 2012-08-16, at 10:06 AM, Paolo Ciccarese <paolo.ciccarese@gmail.com>
> wrote:
>
> > Hi James,
> >
> > On Thu, Aug 16, 2012 at 9:28 AM, James Chartrand <jc.chartrand@mac.com>
> wrote:
> > We're annotating named entity references (to people, places, events,
> organizations) in text like so:
> >
> > <AnnoA> a oa:Annotation ;
> >    oa:hasTarget <urn:uuid:CB37E17E-B1EF-48CE-B098-0CE8FAA86A6D> ;
> >    oax:hasSemanticTag <http://dbpedia.org/page/MrJones> ;
> >    oa:annotator <http://somewhere.ca/people/JamesChartrand> ;
> >    oa:annotated "2012-08-15T12:10:54Z" ;
> >    oa:generator <someSoftwareURI> ;
> >    oa:generated "2012-08-15T12:10:54Z" ;
> >    oa:modelVersion <http://openannotation.org/spec/core/20120509> .
> >
> >   <urn:uuid:CB37E17E-B1EF-48CE-B098-0CE8FAA86A6D> a oa:SpecificResource ;
> >     oa:hasSource <http://somewhere.ca/work/345345> ;
> >     oa:hasSelector <urn:uuid:2ADFF7EE-AB19-4BA3-94EC-55EE0BA645C2> .
> >
> >   <urn:uuid:2ADFF7EE-AB19-4BA3-94EC-55EE0BA645C2> a
> oax:TextOffsetSelector ;
> >     oax:offset 244 ;
> >     oax:range 7 .
> >
> >   <http://somewhere.ca/work/345345> a dctypes:Text .
> >
> >  Three questions:
> >
> > 1.  Does this seem right?
> >
> >
> > At first glance I would say yes.
> >
> > 2.  We'd like to assert the confidence with which we're annotating --
> how certain we are that this reference is to the given person.  Are there
> any recommendations for asserting certainty?
> >
> > I would suggest to create your own property or mechanism and then
> contribute it back to the group. That is certainly something others are
> doing. In my project I have a score property (score that is coming from
> text mining algorithms) attached to the Annotation node. It would be
> interesting to collect the different use cases on this topic on the wiki.
> >
> >
> > 3.  Should the oa provenance predicates be used in preference to the
> PROV ontology (http://www.w3.org/TR/prov-o/) or can PROV be used
> interchangeably?  Is PROV recommended for provenance predicates not covered
> by OA?  Any plans to move to PROV entirely?
> >
> > We worked out some mappings
> > oa:annotator subPropertyOf prov:wasAttributedTo
> > oa:annotated subPropertyOf prov:generateAtTime
> > oa:generator subPropertyOf prov:wasAttributedTo
> > oa:generated subPropertyOf prov:generatedAtTime.
> >
> > I believe we will include/publish them at one point. However, as you can
> see, the expressiveness of the prov properties is not the same.
> >
> > Best,
> > Paolo
> >
>
>


-- 
Dr. Paolo Ciccarese
http://www.paolociccarese.info/
Biomedical Informatics Research & Development
Instructor of Neurology at Harvard Medical School
Assistant in Neuroscience at Mass General Hospital
+1-857-366-1524 (mobile)   +1-617-768-8744 (office)

CONFIDENTIALITY NOTICE: This message is intended only for the addressee(s),
may contain information that is considered
to be sensitive or confidential and may not be forwarded or disclosed to
any other party without the permission of the sender.
If you have received this message in error, please notify the sender
immediately.
Received on Thursday, 16 August 2012 14:51:55 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 16 August 2012 14:51:56 GMT