- From: Hugh Glaser <hg@ecs.soton.ac.uk>
- Date: Mon, 25 Jan 2010 22:10:44 +0000
- To: Mischa Tuffield <mmt04r@ecs.soton.ac.uk>
- CC: "public-lod@w3.org" <public-lod@w3.org>
Thanks mate. On 25/01/2010 20:01, "Mischa Tuffield" <mmt04r@ecs.soton.ac.uk> wrote: > Hi Hugh, > > The code you posted seems sane to me. It is very similar to the php we use to > make sparql-queries. My only comment being that you probably want to have a > line which closes curl connection after you have used it. > > something like : > > <!-- > curl_close($ch); > --> > > You could also set a User Agent, to identify your self to the website's you > are fetching RDF from, you would do it like so : > > <!-- > curl_setopt($ch, CURLOPT_USERAGENT, "hugh's crawler 0.1"); > --> > > I hope this helps, > > Mischa > > On 25 Jan 2010, at 18:10, Hugh Glaser wrote: > >> OK, herešs some fun for you... >> (Excuse me if it has been discussed before, and just point me at it :-) ) >> >> >> Having struggled through the php manual for cURL, I have come up with the >> following draft for getting an RDF document, given a URI. >> >> $ch = curl_init(); >> curl_setopt($ch, CURLOPT_URL, $_REQUEST['uri']); >> curl_setopt($ch, CURLOPT_RETURNTRANSFER, true); >> curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true); >> curl_setopt($ch, CURLOPT_HTTPHEADER, array("Accept: >> application/rdf+xml, text/n3, text/rdf+n3, text/turtle, application/x-turtle, >> application/turtle, text/plain")); >> $data = curl_exec($ch); >> $info = curl_getinfo($ch); >> >> if ($data === FALSE || $info['http_code'] != 200) { >> >> What does anyone think? >> Išm sure there are a bunch of improvements/corrections. >> >> As a (hopefully) separate issue, the MIME types will probably generate some >> discussion, but it is the PHP I am primarily asking about at the moment. >> >> Best >> Hugh >> > > _________________________________ > Mischa Tuffield > ECS - http://www.ecs.soton.ac.uk/ > Homepage - http://users.ecs.soton.ac.uk/mmt04r/ > Identity - http://id.ecs.soton.ac.uk/person/6914 > WebID - http://mmt.me.uk/foaf.rdf#mischa > > > > >
Received on Monday, 25 January 2010 22:20:33 UTC