- From: Ted Thibodeau Jr <tthibodeau@openlinksw.com>
- Date: Wed, 22 Feb 2012 14:55:18 -0500
- To: Jimmy O'Regan <joregan@gmail.com>
- Cc: dbpedia-discussion <dbpedia-discussion@lists.sourceforge.net>, RDF WG <public-rdf-wg@w3.org>
- Message-Id: <7B0FA4CF-318B-46D8-9B1E-B7B809F1CCE3@openlinksw.com>
On Feb 8, 2012, at 03:28 PM, Jimmy O'Regan wrote:
> On 8 February 2012 19:23, Ted Thibodeau Jr <tthibodeau@openlinksw.com> wrote:
>> <http://dbpedia.org/resource/Academy_Award_for_Best_Art_Direction>
>> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
>> <http://www.w3.org/2002/07/owl#Thing>
>> <http://en.wikipedia.org/wiki/Academy_Award_for_Best_Art_Direction#absolute-line=1>
>> .
>>
>>
>> This appears to suggest that the { ?s ?p ?o } triplet was extracted
>> from the resource at the URI in the ?c position -- but the fragment
>> identifier breaks that suggestion, as the above triple simply
>> doesn't come from line 1 of either the Wikipedia markup source --
>>
>> {{Infobox award
>
> That's the line it was generated from. The mapping for this template
> (http://mappings.dbpedia.org/index.php/Mapping:Infobox_award) has a
> 'map to class' for
> http://mappings.dbpedia.org/index.php/OntologyClass:Award, which in
> turn has rdfs:subClassOf owl:Thing
Ah! So "absolute-line=1" is meant to refer to the line *of the
DBpedia mapping* which caused the triple to be generated?
It may surprise you to hear that none of us in the RDF-WG were able
to figure that out from context.
But... Line 1 of that mapping is just --
{{TemplateMapping
Line *2* holds --
| mapToClass = Award
-- so it seems that may need at least a little adjustment.
It further seems to me that there are at least three factors which
provide context for any triple produced by the DBpedia extractors,
all of which should somehow be made available through the fourth
position of the N-quads dump --
1. URI of source document
2. URI of mapping rules
3. timestamp that mapping rules were applied to the source
document, which resulted in generation of the triple (or
rather, its enclosing graph)
(The timestamp in #3 might be sufficient to nail down the revisions
of #1 and #2, or it might not... If not, then at least two more
factors must be made available through the fourth position.)
Melding all these factors into a single string or URI would be ugly
at best, so perhaps there should be an "extraction ontology" which
is used to describe the RDF Graphs produced by the extractors.
I would suggest that the fourth column of the N-quads dumps should
hold a DBpedia URI, perhaps something like --
<http://dbpedia.org/graph/Academy_Award_for_Best_Art_Direction/
20120222Z125218.123456#this>
This URI identifies the RDF Graph (a/k/a "G-snap") produced by the
mapping against the source document, by a combination of the source
document's wikiword and the timestamp of the graph's production. The
RDF Graph can then itself be described with sourceURI, mappingURI,
timeStamp, etc. -- whatever other metadata may make sense.
Users could then
- get one or more complete RDF Graphs, as produced on chosen date(s),
associated with a given wikiword -- whether current or historic;
- compare these RDF Graphs over time
- compare the results of different mappings against the same source
document (wikiword), as each extraction should produce a differently
timestamped RDF Graph
What do you think?
Ted
--
A: Yes. http://www.guckes.net/faq/attribution.html
| Q: Are you sure?
| | A: Because it reverses the logical flow of conversation.
| | | Q: Why is top posting frowned upon?
Ted Thibodeau, Jr. // voice +1-781-273-0900 x32
Evangelism & Support // mailto:tthibodeau@openlinksw.com
// http://twitter.com/TallTed
OpenLink Software, Inc. // http://www.openlinksw.com/
10 Burlington Mall Road, Suite 265, Burlington MA 01803
Weblog -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter -- http://twitter.com/OpenLink
Google+ -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers
Attachments
- application/pkcs7-signature attachment: smime.p7s
Received on Wednesday, 22 February 2012 19:55:44 UTC