W3C home > Mailing lists > Public > www-rdf-interest@w3.org > June 2001

RE: search engine

From: Danny Ayers <danny.ayers@btinternet.com>
Date: Sun, 24 Jun 2001 02:06:42 +0100
To: "Sushmita Upadhaya" <sushmita@navneet.net>, <www-rdf-interest@w3.org>
Message-ID: <EBEPLGMHCDOJJJPCFHEFGEPCDKAA.danny.ayers@btinternet.com>
Hi Sushmita,
I'm not sure what you mean by the results stored in the RDF database - the
Open Directory database (the *big* RDF dumps)? What I'd suggest is that you
have a look at the information you are thinking of indexing, and the RDF
model & syntax pages (there are some write-ups that almost get it into human
language) and see how the two might meet. The standard results page is just
how you do the presentation (any of the server-side languages - Python would
probably be the quickest, though Perl would keep the old-timers happy and
Java would make me smile) - getting the information together in the first
place could be harder. What would be interesting is if you could use
standard search engine techniques (page parsing for meta tags, get the links
a la Google, get the meta tags, get the meta tags) and from this make RDF
representations. With a set of pages like a Uni's site, this would make a
very manageable and worthwhile (?) project .

Cheers,
Danny.
---
Danny Ayers
http://www.isacat.net

  -----Original Message-----
  From: www-rdf-interest-request@w3.org
[mailto:www-rdf-interest-request@w3.org]On Behalf Of Sushmita Upadhaya
  Sent: 22 June 2001 08:05
  To: www-rdf-interest@w3.org
  Subject: search engine


  Hi to all the RDF group,

  My name' s sushmita upadhaya, student .
  I' m working on a project to crawl all the university' s HTML pages and
making it searchble by an internal search
  engine.
  I would like more informations about previous projects to understand how
to
  search the results stored in the RDF database, and how to make them not
  hieracly indexed, only showing a standard search results page (as in a
  common search engine).

  Thanks for the help


  sushmita
Received on Saturday, 23 June 2001 21:11:58 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 7 December 2009 10:51:49 GMT