Re: Handling RDF with a serach engine category structure

kevin.

When I am using a new RDF file, the first thing that I do is to convert
it to n3, and using tools like grep, cut, sort  to slice it up.

The redland tool has a great and fast tool to process large rdf files
easily
http://www.redland.opensource.ac.uk/

The tool is called rdfdump or "rapper". The idea is that you first dump
into n3, that makes the file simpler.

from there, I would extract the predicates, with a command like 
cut -d\< -f2  file.n3  | sort -u > pred.txt

That should extract the second column (predicate) from the file, 
with the predicates handy, you can explore them individually.

Then when you have an overview of the structure of the file,
and are able to query them using simple greps and cuts, then I would
attempt at writing a program. 

just my two euro cents.

mike

--- Kevin Kwan <50349562@student.cityu.edu.hk> wrote:
> Dear Sir,
> 
> I'm a student in City University in Hong Kong and I'm dealing with a
> task 
> which has to build an internal search engine to classify some
> inputted keywords into different category. I have download and study
> Jena, which from http://www.hpl.hp.com/semweb/download.htm, and I got
> some problem in handling RDF.
> 
> Knowing that the RDF used in Internet search engine is the most
> suitable tool for me to perform the task mentioned above, I've
> download the RDF ( 
> structure.rdf.u8.gz) at the web http://dmoz.org/rdf.html and try to
> study it. 
> However, it's structure is different greatly from the RDF structure
> introduced 
> in Jena, so I have no idea on how to handle it. Would you kindly tell
> me how can I hanlde the structure like this? Looking forward to your
> reply, thx!
> 
> Best wishes,
> Kevin Kwan from Hong Kong
> 
> 


=====
James Michael DuPont
http://introspector.sourceforge.net/

__________________________________________________
Do you Yahoo!?
Yahoo! Shopping - Send Flowers for Valentine's Day
http://shopping.yahoo.com

Received on Tuesday, 18 February 2003 12:06:31 UTC