- From: Mischa Tuffield <mmt04r@ecs.soton.ac.uk>
- Date: Mon, 25 Jan 2010 20:01:59 +0000
- To: Hugh Glaser <hg@ecs.soton.ac.uk>
- Cc: "public-lod@w3.org" <public-lod@w3.org>
Hi Hugh, The code you posted seems sane to me. It is very similar to the php we use to make sparql-queries. My only comment being that you probably want to have a line which closes curl connection after you have used it. something like : <!-- curl_close($ch); --> You could also set a User Agent, to identify your self to the website's you are fetching RDF from, you would do it like so : <!-- curl_setopt($ch, CURLOPT_USERAGENT, "hugh's crawler 0.1"); --> I hope this helps, Mischa On 25 Jan 2010, at 18:10, Hugh Glaser wrote: > OK, here’s some fun for you... > (Excuse me if it has been discussed before, and just point me at it :-) ) > > > Having struggled through the php manual for cURL, I have come up with the following draft for getting an RDF document, given a URI. > > $ch = curl_init(); > curl_setopt($ch, CURLOPT_URL, $_REQUEST['uri']); > curl_setopt($ch, CURLOPT_RETURNTRANSFER, true); > curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true); > curl_setopt($ch, CURLOPT_HTTPHEADER, array("Accept: application/rdf+xml, text/n3, text/rdf+n3, text/turtle, application/x-turtle, application/turtle, text/plain")); > $data = curl_exec($ch); > $info = curl_getinfo($ch); > > if ($data === FALSE || $info['http_code'] != 200) { > > What does anyone think? > I’m sure there are a bunch of improvements/corrections. > > As a (hopefully) separate issue, the MIME types will probably generate some discussion, but it is the PHP I am primarily asking about at the moment. > > Best > Hugh > _________________________________ Mischa Tuffield ECS - http://www.ecs.soton.ac.uk/ Homepage - http://users.ecs.soton.ac.uk/mmt04r/ Identity - http://id.ecs.soton.ac.uk/person/6914 WebID - http://mmt.me.uk/foaf.rdf#mischa
Received on Tuesday, 26 January 2010 13:26:10 UTC