Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation

OK, I'll have a go :-)
Why did I think this would be fun to do on a sunny Sunday morning that has turned into afternoon?
Here are the instructions:

 1.  Create a web-accessible directory, let's say foobar, with all your .rdf, .ttl, .ntriples and .html files in it.
 2.  Copy lodpub.php and path.php into it.
 3.  Access path.php from your web server.
 4.  Follow the instruction to paste that text into .htaccess
 5.  You can remove path.php if you like, it was only there to help you get the .htaccess right.

That should be it.
The above text and files are at

Of course, I expect that you can tell me all sorts of problems/better ways, but I am hoping it works for many.

Some explanation:
We use a different method, and I have tried to extract the essence, and keep the code very simple.
We trap all 404 (File not Found) in the directory, and then any requests coming in for non-existent files will generate a 303 with an extension added, depending on the Accept header.
Note that you probably need the leading "/" followed by the full path from the domain root, otherwise it will just print out the text "lodpub.php";
(That is not what the apache specs seem to say, but it is what seems to happen).
If you get "Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument to handle the request.", then it means that web server is not finding your ErrorDocument .
Put the file path.php in the same directory and point your browser at it - this will tell you what the path should be.

Note that the httpd.conf (in /etc/httpd/conf) may not let your override, if your admins have tied things down really tight.
Mine says:
    AllowOverride All

Finally, at the moment, note that I think that apache default does not put the correct MIME type on rdf files, but that is a separate issue, and it makes no difference that the 303 happened.


On 05/07/2009 01:52, "Pierre-Antoine Champin" <> wrote:

> Le 03/07/2009 15:14, Danny Ayers a écrit :
>> 2009/7/2 Bill Roberts<>:
>>> I thought I'd give the .htaccess approach a try, to see what's involved in
>>> actually setting it up.  I'm no expert on Apache, but I know the basics of
>>> how it works, I've got full access to a web server and I can read the online
>>> Apache documentation as well as the next person.
>> I've tried similar, even stuff using PURLs - incredibly difficult to
>> get right. (My downtime overrides all, so I'm not even sure if I got
>> it right in the end)
>> I really think we need a (copy&  paste) cheat sheet.
>> Volunteers?
> (raising my hand) :)*
> Here is a quick python script that makes it easier (if not completely
> immediate). It may still requires a one-liner .htaccess, but one that (I
> think) is authorized by most webmasters.
> I guess a PHP version would not even require that .htaccess, but sorry,
> I'm not fluent in PHP ;)
> So, assuming you want to publish a vocabulary with an RDF and an HTML
> description at, you need to:
> 1. Make `myvoc` a directory at the place where your HTTP server will
>     serve it at the desired URI.
> 2. Copy the script in this directory as 'index.cgi' (or 'index.wsgi' if
>     your server as WSGI support).
> 3. In the same directory, put two files named 'index.html' and
>     'index.rdf'
> If it does not work now (it didn't for me),you have to tell your HTTP
> server that the directory index is index.wsgi. In apache, this is done
> by creating (if not present) a `.htaccess` file in the `myvoc`
> diractory, and adding the following line::
>      DirectoryIndex index.cgi
> (or `index.wsgi`, accordingly)
> There is more docs in the script itself. I think the more recipes
> (including for other httpds) we can provide with the script, the more
> useful it will be. So feel free to propose other ones.
>   enjoy
>    pa

Received on Sunday, 5 July 2009 15:18:07 UTC