- From: Danny Ayers <danny666@virgilio.it>
- Date: Sun, 26 Oct 2003 10:33:18 +0100
- To: <atom-syntax@imc.org>, "Sam Ruby" <rubys@apache.org>
- Cc: "Www-Rdf-Interest" <www-rdf-interest@w3.org>
(cc'd to rdf-interest) Ken MacLeod: > > Sam Ruby is blogging his experiments with XPath and XQuery over his > > accumulated store of XML. I expect as part of his experimentation > > he'll be adding more islands of information to that store as time goes > > along. My bet is that he's going to be writing a lot more supporting > > code and queries to "tie together" those islands of information than > > someone using an RDF store would have to do. Sab Ruby: > IMHO, the most signficant source of metadata in my accumulated store of > information (only recently beeing converted over to XML) is within the > hypertext links within the content. My bet is that writing code and > queries to extract that particular information from well formed XHTML > will be a lot easier than trying to mine that information from within > the parse type = "Literal" islands that have been proposed to date. > > Are there any proposals for "RDFizing" this information? It's a very interesting idea. The <a href="... info is likely to be a lot more useful for queries than knowing e.g. where the <p>s occur. The direct approach may be to pull out the links as extra metadata, given something like: <entry rdf:about="http://www.intertwingly.net/blog/1624.html"> <title>Atom in Depth</title> <content> <img src="http://www.intertwingly.net/images/xml03.jpg" class="floatleft" alt="XML 2003" />&nbsp; I'll giving a presentation entitled <a href="http://www.xmlconference.org/xmlusa/2003/friday.asp#22">Atom in Depth</a> on Friday, December 12th at the <a href="http://www.xmlconference.org/xmlusa/">2003 XML Conference</a> in Philadelphia. </content> ... </entry> it would be possible to extract something like <entry rdf:about="http://www.intertwingly.net/blog/1624.html"> <embeds> <link rdf:about=""http://www.xmlconference.org/xmlusa/2003/friday.asp#22"> <title>Atom in Depth</title> </link> <link rdf:about="http://www.xmlconference.org/xmlusa/"> <title>2003 XML Conference</title> </link> </embeds> </entry> A potentially cool bit is that I think it should be possible to do this on the fly : an RDF query could ask for all links in the content of that entry, and the RDF store could get as far as <content> but then an XPath query could be carried out on that, the results expressed in the RDF model. Why not do it all as XML/XPath? Main reason being that the XML tree structure doesn't match the web's general directed graph structure, probably a more immediate practical problem being that the subtrees aren't very portable across systems (merging isn't straightforward). some related notes : http://dannyayers.com/archives/001981.html Cheers, Danny.
Received on Sunday, 26 October 2003 04:45:53 UTC