Hi, I was a bit frustrated with not having a set of generic URIs I could use for the dc:subject (amongst others) in the RDF instances I produce. So, I've knocked up a quick 100 line PHP script that scrapes the XML export of wikipedia entries, and returns an RDF Class for any wikipedia term you request... For example: http://www.rdfuri.com/wiki/Semantic_Web http://www.rdfuri.com/wiki/John_Kerry http://www.rdfuri.com/wiki/OWL http://www.rdfuri.com/wiki/Owl http://www.rdfuri.com/wiki/Cardiff http://www.rdfuri.com/wiki/Politics (You can use any wikipedia term at the end of the www.rdfuri.com/wiki/ URL). At the moment, it's just scraping for the title (in the different languages) and the categories that have been applied. My two questions are: a) is this at all useful (or is there a better alternative), and b) if it is, what other RDF data could be scraped out of the wiki code for each entry? Thanks, Dan -------------------------------------- Dan Zambonini Box UK Internet Development and Consultancy t: +44 (0)29 2022 8822 f: +44 (0)29 2022 8820 e: dan.zambonini@boxuk.com w: www.boxuk.com --------------------------------------Received on Monday, 23 August 2004 08:22:38 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:52:08 GMT