W3C home > Mailing lists > Public > public-rdfa-wg@w3.org > April 2012

Experimental RDFa extractor in JS

From: Niklas Lindström <lindstream@gmail.com>
Date: Fri, 20 Apr 2012 01:58:02 +0200
Message-ID: <CADjV5jfz9oiP+10mKH7k+2RVJj5bGc8YVObGENCh542-KqsWKg@mail.gmail.com>
To: public-rdfa-wg <public-rdfa-wg@w3.org>
Hi all!

The last couple of days I've been experimenting with a different kind
of approach to implementing an RDFa extractor. The result so far is a
draft with admittedly rather partial coverage. However, I hope some
aspects of it will be of interest even at this stage:

1. It is implemented in pure Javascript. (Well, actually, in some 170
lines of CoffeeScript, but the generated result is the same.)
2. It runs both in the browser and on Node (used with jsdom).
3. It does not produce triples. It directly creates a JSON-LD extract
(corresponding in shape to the RDFa). This is the difference, and the
fun part.

Now, it really doesn't handle anything but the most simple RDFa 1.1.
Possibly all of Lite, plus @datatype, @rel (including hanging),
@inlist, @rev and perhaps one or two more. It only copes with @about
if it's alone, it doesn't handle combinations of @rel and @property,
and so on. I'll strive to make it a lot more compliant given time of

- You can check out the code at: https://github.com/niklasl/rdfa-lab
- Or enjoy the bookmarklet (only tested in Firefox), available at:

(Just add the latter to your bookmarks and apply on any page
containing RDFa. I recommend the JSONView [1] browser add-on for a
good experience.)

I hope you'll enjoy the little things it can do. (For one, using the
resulting JSON-LD directly in a JS application should prove

Best regards,

[1]: http://jsonview.com/
Received on Thursday, 19 April 2012 23:59:01 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:05:31 UTC