Re: enhance GRDDL service to show which versions it's getting, please?

Dan Connolly wrote:
> Dom,
> 
> I'm trying to use the GRDDL service to debug a transformation:
> 
> http://www.w3.org/2007/08/grddl/?docAddr=http://www.w3.org/2000/10/swap/pim/usps 
> http://www.w3.org/2007/08/grddl/
> 
> It seems like the service isn't getting the latest version
> of http://www.w3.org/2000/07/hs78/html2rdfs , but it's really
> hard to tell.
> 
> In the diagnostic comments at the bottom, can you have it
> report the HTTP last-modified times of everything it fetches?

Raptor doesn't make that kind of info available presently.

> p.s. this is an experiment in using public-grddl-comments
> to track GRDDL service support requests.

I also checked it out on Triplr
  http://triplr.org/turtle/www.w3.org/2000/10/swap/pim/usps
which I know is running the latest raptor 1.4.16 release; the grddl service
is running a pre-release.

There were a lot of messages thrown away during GRDDL processing.  I've been
considering turning error messages into at least warnings rather than just
discarding them.  That would make them available for applications.

Anyway on the command line, in debug mode:

$ utils/rapper -c -i grddl --trace http://www.w3.org/2000/10/swap/pim/usps
rapper: Parsing URI http://www.w3.org/2000/10/swap/pim/usps with parser grddl
rapper: Serializing with serializer ntriples
rapper: Processing URI http://www.w3.org/2000/10/swap/pim/usps
raptor_grddl.c:1354:raptor_grddl_parse_chunk: Parser 0x1407fa0: Found
document with type 'text/html; charset=iso-8859-1' is HTML
raptor_grddl.c:1387:raptor_grddl_parse_chunk: Parser 0x1407fa0: URI
http://www.w3.org/2000/10/swap/pim/usps: processing 5977 bytes of content
raptor_grddl.c:1395:raptor_grddl_parse_chunk: Parser 0x1407fa0: Creating an
XML parser
raptor_grddl.c:1484:raptor_grddl_parse_chunk: Parser 0x1407fa0: Parsing as XML
raptor_grddl.c:1486:raptor_grddl_parse_chunk: Parser 0x1407fa0: Parsing as
XML succeeded
raptor_grddl.c:1533:raptor_grddl_parse_chunk: Parser 0x1407fa0: Running
XInclude processing on URI 'http://www.w3.org/2000/10/swap/pim/usps'
raptor_grddl.c:1550:raptor_grddl_parse_chunk: Parser 0x1407fa0: XML Include
processing returned 6053 bytes document
raptor_grddl.c:1556:raptor_grddl_parse_chunk: Parser 0x1407fa0: Running
top-level GRDDL on URI 'http://www.w3.org/2000/10/swap/pim/usps'
raptor_grddl.c:1575:raptor_grddl_parse_chunk: Parser 0x1407fa0: Root
namespace URI is http://www.w3.org/1999/xhtml
raptor_grddl.c:1590:raptor_grddl_parse_chunk: Parser 0x1407fa0: Ignoring
GRDDL for namespace URI 'http://www.w3.org/1999/xhtml'
raptor_grddl.c:1129:raptor_grddl_run_xpath_match: Got list match URI
'http://www.w3.org/2003/g/data-view'
raptor_grddl.c:1653:raptor_grddl_parse_chunk: Parser 0x1407fa0: Found 1
<head profile> URIs in URI 'http://www.w3.org/2000/10/swap/pim/usps'
raptor_grddl.c:1668:raptor_grddl_parse_chunk: Ignoring <head profile> of URI
http://www.w3.org/2000/10/swap/pim/usps: URI http://www.w3.org/2003/g/data-view
raptor_grddl.c:1154:raptor_grddl_run_xpath_match: Got single match URI
'http://www.w3.org/2003/g/pgdata'
raptor_grddl.c:1154:raptor_grddl_run_xpath_match: Got single match URI
'http://www.w3.org/2000/07/hs78/html2rdfs'
raptor_grddl.c:502:raptor_grddl_add_transform_xml_context: Adding new
document transformation XSLT URI http://www.w3.org/2003/g/pgdata with base
URI http://www.w3.org/2000/10/swap/pim/usps
raptor_grddl.c:502:raptor_grddl_add_transform_xml_context: Adding new
document transformation XSLT URI http://www.w3.org/2000/07/hs78/html2rdfs
with base URI http://www.w3.org/2000/10/swap/pim/usps
raptor_grddl.c:944:raptor_grddl_run_grddl_transform_uri: Running GRDDL
transform with XSLT URI http://www.w3.org/2003/g/pgdata and base URI
http://www.w3.org/2000/10/swap/pim/usps
rapper: Processing URI http://www.w3.org/2003/g/pgdata
raptor_grddl.c:655:raptor_grddl_run_grddl_transform_doc: Running GRDDL
transform with XSLT URI 'http://www.w3.org/2003/g/pgdata' with doc base URI
'http://www.w3.org/2000/10/swap/pim/usps'
raptor_grddl.c:721:raptor_grddl_run_grddl_transform_doc: XSLT returned 495
bytes document method xml media type NULL
raptor_grddl.c:759:raptor_grddl_run_grddl_transform_doc: Parser 0x1407fa0:
Guessed parser rdfxml from mime type 'application/rdf+xml' and content
raptor_grddl.c:596:raptor_grddl_ensure_internal_parser: Parser 0x1407fa0:
Allocating new internal rdfxml parser.
raptor_grddl.c:944:raptor_grddl_run_grddl_transform_uri: Running GRDDL
transform with XSLT URI http://www.w3.org/2000/07/hs78/html2rdfs and base
URI http://www.w3.org/2000/10/swap/pim/usps
rapper: Processing URI http://www.w3.org/2000/07/hs78/html2rdfs
raptor_grddl.c:655:raptor_grddl_run_grddl_transform_doc: Running GRDDL
transform with XSLT URI 'http://www.w3.org/2000/07/hs78/html2rdfs' with doc
base URI 'http://www.w3.org/2000/10/swap/pim/usps'
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: no id attribute to go with profile on head; use id='' to denote the
empty prefix.
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: no link relationship prefix found:
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: no classes link
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: no id attribute on classes link; use id='' to denote the empty prefix.
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: class identifier 'MailingLocation' != label 'Mailing Location'
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: class identifier 'MailingLocation' != label 'Mailing Location'
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (deliveryAddress) different from property label (delivery
address)
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (zipCode) different from property label (zip code)
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (cityName) different from property label (city name)
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (stateAbbr) different from property label (state
abbreviation)
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (returnTo) different from property label (return to)
rapper: Error - URI http://www.w3.org/2000/07/hs78/html2rdfs:1 - libxslt
error: property ID (returnAt) different from property label (return at)
raptor_grddl.c:721:raptor_grddl_run_grddl_transform_doc: XSLT returned 6323
bytes document method xml media type NULL
raptor_grddl.c:759:raptor_grddl_run_grddl_transform_doc: Parser 0x1407fa0:
Guessed parser rdfxml from mime type 'application/rdf+xml' and content
rapper: Failed to parse URI http://www.w3.org/2000/10/swap/pim/usps grddl
content
rapper: Parsing returned 85 triples


i.e. this transform is failing
  doc:  http://www.w3.org/2000/10/swap/pim/usps
  xslt: http://www.w3.org/2000/07/hs78/html2rdfs
which you can try with
  $ xsltproc http://www.w3.org/2000/07/hs78/html2rdfs
http://www.w3.org/2000/10/swap/pim/usps
or your favourite XSLT engine.

Where "failing" means generating xsl:message to the console/error log.
I don't think libxslt can distinguish those.

Dave

Received on Friday, 5 October 2007 01:20:33 UTC