refining sitemap.py to support typed links Re: [HTML]

On Tue, 2005-08-02 at 22:03 +0100, Dan Brickley wrote:
> As discussed in today's RDF-in-XHTML taskforce call, I've made some
> progress towards my action on a Note describing the XHTML2
> link types in RDF. See work-in-progress c/o:
>    
>     XHTML link types in RDF
>     W3C Editor's Working Draft 2 August 2005
>     http://www.w3.org/2005/05/hrel/
>     $Id: Overview.html,v 1.52 2005/08/02 20:54:23 danbri Exp $

That reminds me of...

[[
sitemap -- crawl web site, record titles and links

design sketch:

1. content = urlopen.urlopen(startAddr)
2. xmlDoc = xmlparse(content)
3. neighbors = xpath.eval(xmlDoc, "//a/@href"); neighbors = map(lambda x: uripath.join(x, startAddr), neighbors)
4. title = xpath.eval(xmlDoc, "//title")
5. kb.addTriple(startAddr, dc.title, title)
6. for n in neighbors: kb.addTriple(startAddr, dc:relation, n)
7. queue unseen neighbors, recur
]]
 -- http://www.w3.org/2000/10/swap/util/sitemap.py

Let's enhance it to use the rel attribute to specialize dc:relation.

As to:

TODO:
  -- circles/arrows diagram of results

I have done that a few times, but haven't published the results.
I have a graphic stylesheet...
  http://www.w3.org/2001/02pd/sitemap-style.n3
and there's a Makefile stanza in...
  http://www.w3.org/2001/02pd/Makefile

Let's put together a little article on putting it all together, shall we?

Future/related work:
 -- google sitemap protocol
 -- tag siteData-36 issue




-- 
Dan Connolly, W3C http://www.w3.org/People/Connolly/
D3C2 887B 0F92 6005 C541  0875 0F91 96DE 6E52 C29E

Received on Monday, 8 August 2005 16:43:05 UTC