W3C home > Mailing lists > Public > public-rdf-in-xhtml-tf@w3.org > March 2008

Different results of different RDFa extractors

From: Sébastien Laborie <Sebastien.Laborie@inrialpes.fr>
Date: Thu, 27 Mar 2008 13:44:38 +0100
To: public-rdf-in-xhtml-tf@w3.org
Message-Id: <3E448A17-79EB-4696-AD03-1A1DF75FBF2C@inrialpes.fr>
Cc: Faisal.Alkhateeb@inrialpes.fr
Hi everyone,

We have tested the RDFa Distiller (http://www.w3.org/2007/08/pyRdfa/)  
and the following Java extractors : RDFa extractor and SweetWiki .  
The tested XHTML+RDFa web page is the following : http:// 
www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb-CapitalEurope/ 
index.xhtml. This web page has been validated with W3C XTHML+RDFa  
validator.

Here are the results for each one:

****************RDFa Distiller*****************
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam> <http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type> <http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/city>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam> <http://www.inrialpes.fr/exmo/people/laborie/ 
SPARQLMM/name> "Amsterdam"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam> <http://xmlns.com/foaf/0.1/depiction>  
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam.jpg>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> <http://www.w3.org/1999/xhtml/ 
vocab#stylesheet> <http://www.inrialpes.fr/exmo/people/laborie/ 
SPARQLMM/PageWeb-CapitalEurope/emx_nav_left.css>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/BigBenLondon.jpg> <http://www.w3.org/1999/02/22-rdf- 
syntax-ns#type> <http://xmlns.com/foaf/0.1/Image>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/BigBenLondon.jpg> <http://purl.org/dc/elements/1.1/ 
format> "image/jpeg"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Rome.jpg> <http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type> <http://xmlns.com/foaf/0.1/Image>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Rome.jpg> <http://purl.org/dc/elements/1.1/format>  
"image/jpeg"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Roma> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>  
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/city>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Roma> <http://www.inrialpes.fr/exmo/people/laborie/ 
SPARQLMM/name> "Roma"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Roma> <http://xmlns.com/foaf/0.1/depiction> <http:// 
www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb-CapitalEurope/ 
Rome.jpg>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam.jpg> <http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type> <http://xmlns.com/foaf/0.1/Image>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/Amsterdam.jpg> <http://purl.org/dc/elements/1.1/format>  
"image/jpeg"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/London> <http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type> <http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/city>.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/London> <http://www.inrialpes.fr/exmo/people/laborie/ 
SPARQLMM/name> "London"@en.
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/London> <http://xmlns.com/foaf/0.1/depiction> <http:// 
www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb-CapitalEurope/ 
BigBenLondon.jpg>.
**********************************************

****************RDFa Extractor*****************
Parsing HTML file result :
http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml seems to be well-formed.

===========================================================

Using BASE URI: http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/ 
PageWeb-CapitalEurope/index.xhtml

Resulting RDF graph :
<> <ex:name> "London" .
<> <ex:name> "Amsterdam" .
<> <dc:format> "image/jpeg" .
<> <foaf:depiction> <> .
<> <ex:name> "Roma" .
_:Ahead1206620838409 <stylesheet> <> .
*********************************************

**************** SweetWiki*****************
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.w3.org/1999/xhtml <http:// 
www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb-CapitalEurope/ 
emx_nav_left.css>  .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.inrialpes.fr/exmo/people/ 
laborie/SPARQLMM/name "Roma"^^http://www.w3.org/2000/01/rdf- 
schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://xmlns.com/foaf/0.1/depiction  
_:#N10086  .
_:#N10086 http://purl.org/dc/elements/1.1/format "image/jpeg"^^http:// 
www.w3.org/2000/01/rdf-schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type <http://www.w3.org/1999/xhtml>  .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.inrialpes.fr/exmo/people/ 
laborie/SPARQLMM/name "Amsterdam"^^http://www.w3.org/2000/01/rdf- 
schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://xmlns.com/foaf/0.1/depiction  
_:#N100C7  .
_:#N100C7 http://purl.org/dc/elements/1.1/format "image/jpeg"^^http:// 
www.w3.org/2000/01/rdf-schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type <http://www.w3.org/1999/xhtml>  .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.inrialpes.fr/exmo/people/ 
laborie/SPARQLMM/name "London"^^http://www.w3.org/2000/01/rdf- 
schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://xmlns.com/foaf/0.1/depiction  
_:#N10108  .
_:#N10108 http://purl.org/dc/elements/1.1/format "image/jpeg"^^http:// 
www.w3.org/2000/01/rdf-schema#Literal@en .
<http://www.inrialpes.fr/exmo/people/laborie/SPARQLMM/PageWeb- 
CapitalEurope/index.xhtml> http://www.w3.org/1999/02/22-rdf-syntax- 
ns#type <http://www.w3.org/1999/xhtml>  .
********************************************

It seems that they give different results. We wonder why ? And which  
one is the correct result ?

best regards,

Sébastien Laborie and Faisal Alkhateeb

--
INRIA Rhône-Alpes
http://www.inrialpes.fr/exmo/people/laborie/





Received on Thursday, 27 March 2008 16:36:08 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:01:56 UTC