W3C home > Mailing lists > Public > public-rdf-wg@w3.org > February 2012

intended semantics of fourth column values in DBpedia N-quads (ACTION-144)

From: Ted Thibodeau Jr <tthibodeau@openlinksw.com>
Date: Wed, 8 Feb 2012 14:23:58 -0500
Cc: RDF WG <public-rdf-wg@w3.org>
To: dbpedia-discussion <dbpedia-discussion@lists.sourceforge.net>
Message-Id: <2AC13DAF-AFF3-4CA1-91A9-33A8AE799E2F@openlinksw.com>
All --

A question has come up on the W3C RDF Working Group mailing list --

   <http://lists.w3.org/Archives/Public/public-rdf-wg/2012Feb/0042.html>

-- and in recent WG conference calls --

   <http://www.w3.org/2011/rdf-wg/meeting/2012-02-08>

-- about the semantics of fourth column values in the N-quads 
DBpedia downloads, e.g. --

   http://wiki.dbpedia.org/Downloads37

-- which are arguably one of the largest current uses of N-quads.


The downloads page says "the N-Quads version contains additional 
provenance information for each statement", but does not explain 
what that "additional provenance information" might be meant to
convey or represent.

A sample line from a randomly chosen dump file --
   <http://downloads.dbpedia.org/preview.php?file=3.7_sl_en_sl_instance_types_en.nq.bz2>

-- reads as follows (broken across successive lines for email) --


   <http://dbpedia.org/resource/Academy_Award_for_Best_Art_Direction> 
   <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
   <http://www.w3.org/2002/07/owl#Thing> 
   <http://en.wikipedia.org/wiki/Academy_Award_for_Best_Art_Direction#absolute-line=1> 
   .


This appears to suggest that the { ?s ?p ?o } triplet was extracted 
from the resource at the URI in the ?c position -- but the fragment 
identifier breaks that suggestion, as the above triple simply 
doesn't come from line 1 of either the Wikipedia markup source --

   {{Infobox award

-- or the HTML representation --

   <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" 
   "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

-- of the named resource --

   <http://en.wikipedia.org/wiki/Academy_Award_for_Best_Art_Direction>


Any insight would be helpful.

Regards,

Ted



--
A: Yes.                      http://www.guckes.net/faq/attribution.html
| Q: Are you sure?
| | A: Because it reverses the logical flow of conversation.
| | | Q: Why is top posting frowned upon?

Ted Thibodeau, Jr.           //               voice +1-781-273-0900 x32
Evangelism & Support         //        mailto:tthibodeau@openlinksw.com
                             //              http://twitter.com/TallTed
OpenLink Software, Inc.      //              http://www.openlinksw.com/
         10 Burlington Mall Road, Suite 265, Burlington MA 01803
     Weblog   -- http://www.openlinksw.com/blogs/
     LinkedIn -- http://www.linkedin.com/company/openlink-software/
     Twitter  -- http://twitter.com/OpenLink
     Google+  -- http://plus.google.com/100570109519069333827/
     Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers







Received on Wednesday, 8 February 2012 19:29:22 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 22:02:03 UTC