- From: Dan Brickley <danbri@danbri.org>
- Date: Fri, 25 Jan 2008 18:30:57 +0000
- To: Bijan Parsia <bparsia@cs.man.ac.uk>
- Cc: "Ralph R. Swick" <swick@w3.org>, editor@content-wire.com, semantic-web@w3c.org, Dominique Hazael-Massieux <dom@w3.org>
+cc: Dom Bijan Parsia wrote: > One thing that might be helpful...and doesn't even require W3Cness!...to > put together an alternative interface, e.g., exhibit or j/mspace or... > > A simple scrape of TR plus all the prior versions would not only make it > easy to find the latest, but see prior versions etc. > > I don't have the time, etc. to do the whole task, but I'd certainly > help/host any efforts in this direction. I, personally, would benefit > from such a service :) That's an interesting idea. No scraping needed though. Anyone who manages to read all the way to the bottom of http://www.w3.org/TR/ and sees the RDF logo there, and decides to click it, is rewarded with a link to http://www.w3.org/2002/01/tr-automation/ """Automating the publication of Technical Reports Abstract This document presents the "TR Automation" project; this project, based on the use of Semantic Web tools and technologies, has allowed to streamline the publication paper trail of W3C Technical Reports, to maintain an RDF-formalized index of these specifications and to create a number of tools using these newly available data.""" There's an RDF version here, so no need to scrape. http://www.w3.org/2002/01/tr-automation/tr.rdf This seems to be fresh: HEAD /2002/01/tr-automation/tr.rdf HTTP/1.1 Host: foo HTTP/1.1 200 OK Date: Fri, 25 Jan 2008 18:08:16 GMT Server: Apache/2 Last-Modified: Thu, 24 Jan 2008 14:39:20 GMT ETag: "44478ce9cf600" Accept-Ranges: bytes Content-Length: 540774 Cache-Control: max-age=21600 Expires: Sat, 26 Jan 2008 00:08:16 GMT P3P: policyref="http://www.w3.org/2001/05/P3P/p3p.xml" Content-Type: application/rdf+xml; qs=0.9 This gives 7k triples, FlipFlop:~ danbri$ rapper --count http://www.w3.org/2002/01/tr-automation/tr.rdf rapper: Parsing URI http://www.w3.org/2002/01/tr-automation/tr.rdf rapper: Parsing returned 7675 triples Quick SPARQL experiment: FlipFlop:tr-automation danbri$ more recent.rq PREFIX doc: <http://www.w3.org/2000/10/swap/pim/doc#> PREFIX dc: <http://purl.org/dc/elements/1.1/> SELECT DISTINCT ?title ?manifestation ?date FROM <tr.rdf> WHERE { ?manifestation doc:versionOf ?work; dc:date ?date; dc:title ?title. FILTER ( regex( ?date, "200[8]")). } ORDER BY ?date Running this with Roqet, a Redland utility: FlipFlop:tr-automation danbri$ roqet recent.rq roqet: Querying from file recent.rq roqet: Query has a variable bindings result result: [title=string("XHTML Access Module"), manifestation=uri<http://www.w3.org/TR/2008/WD-xhtml-access-20080107>, date=string("2008-01-07")] result: [title=string("OWL 1.1 Web Ontology Language: Mapping to RDF Graphs"), manifestation=uri<http://www.w3.org/TR/2008/WD-owl11-mapping-to-rdf-20080108/>, date=string("2008-01-08")] result: [title=string("OWL 1.1 Web Ontology Language: Model-Theoretic Semantics"), manifestation=uri<http://www.w3.org/TR/2008/WD-owl11-semantics-20080108/>, date=string("2008-01-08")] result: [title=string("OWL 1.1 Web Ontology Language: Structural Specification and Functional-Style Syntax"), manifestation=uri<http://www.w3.org/TR/2008/WD-owl11-syntax-20080108/>, date=string("2008-01-08")] result: [title=string("SMIL Timesheets 1.0"), manifestation=uri<http://www.w3.org/TR/2008/WD-timesheets-20080110/>, date=string("2008-01-10")] result: [title=string("Service Modeling Language, Version 1.1"), manifestation=uri<http://www.w3.org/TR/2008/WD-sml-20080114/>, date=string("2008-01-14")] result: [title=string("Service Modeling Language Interchange Format Version 1.1"), manifestation=uri<http://www.w3.org/TR/2008/WD-sml-if-20080114/>, date=string("2008-01-14")] result: [title=string("Synchronized Multimedia Integration Language (SMIL 3.0)"), manifestation=uri<http://www.w3.org/TR/2008/CR-SMIL3-20080115/>, date=string("2008-01-15")] result: [title=string("SPARQL Query Results XML Format"), manifestation=uri<http://www.w3.org/TR/2008/REC-rdf-sparql-XMLres-20080115/>, date=string("2008-01-15")] result: [title=string("SPARQL Protocol for RDF"), manifestation=uri<http://www.w3.org/TR/2008/REC-rdf-sparql-protocol-20080115/>, date=string("2008-01-15")] result: [title=string("SPARQL Query Language for RDF"), manifestation=uri<http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/>, date=string("2008-01-15")] result: [title=string("HTML 5"), manifestation=uri<http://www.w3.org/TR/2008/WD-html5-20080122/>, date=string("2008-01-22")] result: [title=string("HTML 5 differences from HTML 4"), manifestation=uri<http://www.w3.org/TR/2008/WD-html5-diff-20080122/>, date=string("2008-01-22")] result: [title=string("Relationship Between Mobile Web Best Practices 1.0 and Web Content Accessibility Guidelines"), manifestation=uri<http://www.w3.org/TR/2008/WD-mwbp-wcag-20080122/>, date=string("2008-01-22")] roqet: Query returned 14 results These dates as strings not datatyped, so for this experiment I just asked it what W3C had been up to in 2008, expressed as a regex. A real app would probably have to rummage around a bit more to find latest version, I'm not sure what can be done in a single pass of SPARQL. The data also has author information at all, but unfortunately no identifying properties of the authors currently. If anyone's going to play with this, do look around the project page at http://www.w3.org/2002/01/tr-automation/ for tools and ideas. And of course if you do make a fancy pretty alternate interface, please take care to make it clear that the page is your effort not W3C's, and that it itself might not be up to date. Otherwise we could end up back where we started here :) cheers, Dan ps. on a related note, http://tirania.org/blog/archive/2008/Jan-24-1.html describes a similar problem with software downloads and users finding the old version by accident...
Received on Friday, 25 January 2008 18:31:20 UTC