W3C home > Mailing lists > Public > www-rdf-interest@w3.org > September 2002

Re: Extracting RDF from XHTML table with XSLT

From: Masahide Kanzaki <post@kanzaki.com>
Date: Mon, 23 Sep 2002 15:32:05 +0900
Message-Id: <p04320401b9b4605e74ab@[[192.168.0.4]]>
To: www-rdf-interest@w3.org
Cc: seth@robustai.net, seth@robustai.net

Thank you Seth for your comment.

> >(If an XHTML author uses multi-word <th>, such as "Publishing Place", this
> >approach fails, but there will be some ways to manage these situations)
> >
>May I suggest that there is a fairly obvious transformation of multi
>word phrases to rdf compatable Id's.  Remove punctuation, lower case the
>phrase (so will produce more matches) and substitute '_' for space.
>  Where a phrase contains a quote, substitute '-' for space inside the
>quotation.

Well, I added a translate() function to my sample XSLT so that conversions as

>Publishing Place  --> publishing_place

take place.

All lower case would be better according to the property name convention. I
am not sure, however, if we should do this conversion, because these names
do not exactly match the WordNet vocabulary which is generally defined with
first letter capitalized.

Actually, we can retrieve an RDF schema for class 'Place' even if the name
is lowered as http://xmlns.com/wordnet/1.6/place. But there is no such
class 'place' in the schema.... Does this matter for semantic agent or not ?

Your comments and suggestions are welcome.

Best regards,

Masahide Kanzaki
http://kanzaki.com/info/webwho.rdf
Received on Monday, 23 September 2002 02:47:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:51:56 GMT