Re: web to semantic web : an automated approach

>>>You cannot expect information or more precise raw data to produce
meaningful semantic content if it was never produced in a format allowing
for >>>semantic output.

I have big hopes on the NLP systems here. They are pretty advanced these
days and can only improve in the near/far future. Not all data on the
Internet is written in manner that would be difficult for NLP systems to
parse(eg the one using slang etc). Many articles such as ones written on
wikipedia or say times.com or many blogs are of good quality and a large
portion of them should be understandable by NLP systems.



>>>We will have to live with the fact that maybe more than half of all
"content" on the Web will never lend itself to conversion into useful
semantic >>>content

The good thing about web is that many important pieces of information are
duplicated at multiple places. If NLP systems faild to retrieve some
information from one source, there are other sources where it might succedd.
The good thing about symentic web is that we only have to _get information
right only once_ (or more generally speaking, right count should be more
than wrong count).  This is unlike the current search engines which keeps
track of information from _all_ the sources. IMHO if we are able to
understand even quater of all the inforamtion on web, semantic web would be
an unparalled success.

Received on Sunday, 26 October 2008 16:36:26 UTC