- From: Hugh Glaser <hg@ecs.soton.ac.uk>
- Date: Mon, 25 Jan 2010 22:10:44 +0000
- To: Mischa Tuffield <mmt04r@ecs.soton.ac.uk>
- CC: "public-lod@w3.org" <public-lod@w3.org>
Thanks mate.
On 25/01/2010 20:01, "Mischa Tuffield" <mmt04r@ecs.soton.ac.uk> wrote:
> Hi Hugh,
>
> The code you posted seems sane to me. It is very similar to the php we use to
> make sparql-queries. My only comment being that you probably want to have a
> line which closes curl connection after you have used it.
>
> something like :
>
> <!--
> curl_close($ch);
> -->
>
> You could also set a User Agent, to identify your self to the website's you
> are fetching RDF from, you would do it like so :
>
> <!--
> curl_setopt($ch, CURLOPT_USERAGENT, "hugh's crawler 0.1");
> -->
>
> I hope this helps,
>
> Mischa
>
> On 25 Jan 2010, at 18:10, Hugh Glaser wrote:
>
>> OK, herešs some fun for you...
>> (Excuse me if it has been discussed before, and just point me at it :-) )
>>
>>
>> Having struggled through the php manual for cURL, I have come up with the
>> following draft for getting an RDF document, given a URI.
>>
>> $ch = curl_init();
>> curl_setopt($ch, CURLOPT_URL, $_REQUEST['uri']);
>> curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
>> curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
>> curl_setopt($ch, CURLOPT_HTTPHEADER, array("Accept:
>> application/rdf+xml, text/n3, text/rdf+n3, text/turtle, application/x-turtle,
>> application/turtle, text/plain"));
>> $data = curl_exec($ch);
>> $info = curl_getinfo($ch);
>>
>> if ($data === FALSE || $info['http_code'] != 200) {
>>
>> What does anyone think?
>> Išm sure there are a bunch of improvements/corrections.
>>
>> As a (hopefully) separate issue, the MIME types will probably generate some
>> discussion, but it is the PHP I am primarily asking about at the moment.
>>
>> Best
>> Hugh
>>
>
> _________________________________
> Mischa Tuffield
> ECS - http://www.ecs.soton.ac.uk/
> Homepage - http://users.ecs.soton.ac.uk/mmt04r/
> Identity - http://id.ecs.soton.ac.uk/person/6914
> WebID - http://mmt.me.uk/foaf.rdf#mischa
>
>
>
>
>
Received on Monday, 25 January 2010 22:20:33 UTC