W3C home > Mailing lists > Public > public-lod@w3.org > April 2009

Re: numeric data on the web, numeric web search

From: Wolfgang Orthuber <orthuber@kfo-zmk.uni-kiel.de>
Date: Thu, 30 Apr 2009 12:04:24 +0100
Message-ID: <002e01c9c983$6c27b490$a3b35ec2@workstation>
To: <public-lod@w3.org>
Thanks for the information! I have posted
http://www.thenationaldialogue.org/ideas/numeric-web-search
an hope it helps.

Let me know if you have further ideas and suggestions, I am always interested.

Wolfgang

----- Original Message ----- 
From: "Kingsley Idehen" <kidehen@openlinksw.com>
To: "Semantics-ProjectParadigm" <metadataportals@yahoo.com>
Cc: <public-lod@w3.org>; "Wolfgang Orthuber" <orthuber@kfo-zmk.uni-kiel.de>; "semantic-web"
<semantic-web@w3.org>
Sent: Wednesday, April 29, 2009 9:35 PM
Subject: Re: numeric data on the web, numeric web search


> Semantics-ProjectParadigm wrote:
>> Excellent and timely starting point. Will definitely look into sending in ideas about making available data
>> that will help stimulate "green revolution" and generate green jobs.
>>
> Even if the Govts. of the world simply publish XML based structured data, that alone would deliver full
> employment and lots of follow-on  opportunities for the RDFization technology developers  :-)
>
> So as long as the data is structured, there will be huge opportunities for those that grok Linked Data and
> the process of RDFization, esp. those that  generate wrapper/proxy based Linked Data URIs  :-)
>
>
> Kingsley
>>
>> December 2009 in Denmark follow up to Kyoto is coming up. Oceans of raw data waiting to be processed to
>> come up with policies that both address climate change issues AND generate new jobs!
>>
>> Milton Ponson
>> GSM: +297 747 8280
>> Rainbow Warriors Core Foundation
>> PO Box 1154, Oranjestad
>> Aruba, Dutch Caribbean
>> www.rainbowwarriors.net
>> Project Paradigm: A structured approach to bringing the tools for sustainable development to all
>> stakeholders worldwide
>> www.projectparadigm.info
>> NGO-Opensource: Creating ICT tools for NGOs worldwide for Project Paradigm
>> www.ngo-opensource.org
>> MetaPortal: providing online access to web sites and repositories of data and information for sustainable
>> development
>> www.metaportal.info
>> SemanticWebSoftware, part of NGO-Opensource to enable SW technologies in the Metaportal project
>> www.semanticwebsoftware.info
>>
>>
>> --- On *Wed, 4/29/09, Kingsley Idehen /<kidehen@openlinksw.com>/* wrote:
>>
>>
>>     From: Kingsley Idehen <kidehen@openlinksw.com>
>>     Subject: Re: numeric data on the web, numeric web search
>>     To: "Semantics-ProjectParadigm" <metadataportals@yahoo.com>
>>     Cc: public-lod@w3.org, "Wolfgang Orthuber"
>>     <orthuber@kfo-zmk.uni-kiel.de>, "semantic-web" <semantic-web@w3.org>
>>     Date: Wednesday, April 29, 2009, 7:50 PM
>>
>>     Semantics-ProjectParadigm wrote:
>>     > See
>>     http://googleblog.blogspot.com/2009/04/adding-search-power-to-public-data.html.
>>     >
>>     > This is the first intent at making large amounts of data
>>     available in structured formats.
>>     >
>>     > Although it is not linked data in all conceivable formats from
>>     all sources on the web, the fact that the E-Government Act is
>>     forcing US federal agencies public data to make their data more
>>     accessible could be the push required to get linked data
>>     initiatives to the next level.
>>     >
>>     > Time for a Semantic Web/Linked Data lobby in DC to make funding
>>     available to expand to all public domains.\
>>     >
>>
>>     We can start here:
>>
>> http://www.thenationaldialogue.org/ideas/make-collecting-recovery-data-agile-using-semantic-web-technology
>> :-)
>>
>>
>>     Kingsley
>>     >
>>     > Milton Ponson
>>     > GSM: +297 747 8280
>>     > Rainbow Warriors Core Foundation
>>     > PO Box 1154, Oranjestad
>>     > Aruba, Dutch Caribbean
>>     > www.rainbowwarriors.net
>>     > Project Paradigm: A structured approach to bringing the tools
>>     for sustainable development to all stakeholders worldwide
>>     > www..projectparadigm.info
>>     > NGO-Opensource: Creating ICT tools for NGOs worldwide for
>>     Project Paradigm
>>     > www.ngo-opensource.org
>>     > MetaPortal: providing online access to web sites and
>>     repositories of data and information for sustainable development
>>     > www.metaportal.info
>>     > SemanticWebSoftware, part of NGO-Opensource to enable SW
>>     technologies in the Metaportal project
>>     > www.semanticwebsoftware.info
>>     >
>>     >
>>     > --- On *Wed, 4/29/09, Wolfgang Orthuber
>>     /<orthuber@kfo-zmk.uni-kiel.de
>>     </mc/compose?to=orthuber@kfo-zmk.uni-kiel.de>>/* wrote:
>>     >
>>     >
>>     >     From: Wolfgang Orthuber <orthuber@kfo-zmk.uni-kiel.de
>>     </mc/compose?to=orthuber@kfo-zmk.uni-kiel.de>>
>>     >     Subject: numeric data on the web, numeric web search
>>     >     To: public-lod@w3.org </mc/compose?to=public-lod@w3.org>
>>     >     Date: Wednesday, April 29, 2009, 3:25 PM
>>     >
>>     >     Hello!
>>     >          We know that quantifiable objects play a central role
>>     in daily
>>     >     life. Nevertheless up to now quantifiable objects have in
>>     general
>>     >     no well defined globally machine readable and precise
>>     >     representation on the web. The following concept proposes a
>>     simple
>>     >     data structure called "pattern" for such representation of
>>     >     quantifiable objects in general which also allows their
>>     similarity
>>     >     search:
>>     >     --------
>>     >          * Numeric web search *
>>     >          Web search is up to now word based. Additionally language
>>     >     independent similarity search of quantifiable objects is
>>     >     desirable. For well defined numeric representation of
>>     quantifiable
>>     >     objects a simple data structure called "pattern" is proposed,
>>     >     which contains a feature vector (a sequence of numbers) for
>>     >     representation of the object, and a "pattern name" which is
>>     a URI
>>     >     which uniquely identifies the kind of object which is
>>     represented
>>     >     by the feature vector.
>>     >          Pattern:                  Pattern name       +       feature
>>     >     vector          (+ auxilliary data)
>>     >          Patterns with the same pattern name represent the same
>>     kind of
>>     >     object. Because the number of possible pattern names is not
>>     >     limited*, infinitely* many different kinds of quantifiable
>>     objects
>>     >     can be represented by patterns.  (*only physically limited by
>>     >     finite time and energy)
>>     >          So the search terms are not words, but feature vectors
>>     in patterns
>>     >     which allow quantification of similarity. Feature vectors of
>>     >     patterns with the same pattern name are directly comparable
>>     using
>>     >     a given metric. At this similarities of the original
>>     quantifiable
>>     >     objects are mapped to spatial similarities of the feature
>>     vectors.
>>     >     So similarity search is possible by calculating distances:
>>     Objects
>>     >     are the more similar, the smaller the distance between the
>>     feature
>>     >     vectors of the representing patterns is.
>>     >          Due to the multitude of different kinds of quantifiable
>>     objects
>>     >     the work for development of efficient pattern resp. feature
>>     vector
>>     >     definitions for their representation is open ended. Global task
>>     >     sharing has the greatest potential: According to this suggestion
>>     >     every owner of an internet domain name abc.xyz gets the right to
>>     >     define feature vectors of all patterns with names abc.xyz/* (in
>>     >     well defined location abc.xyz/pat/*).
>>     >          Patterns are machine readable, uniformly comparable and
>>     >     searchable. They allow to search with the same search engine not
>>     >     only for text, but also for an increasing number of well-defined
>>     >     quantifiable objects on the web. This bundling of the search
>>     >     activity into one crawler and web database for all quantifiable
>>     >     objects is much more efficient than building and managing a
>>     >     database and a crawler for every kind of object.
>>     >          Numeric similarity search could be efficiently combined
>>     with
>>     >     conventional word based search. Details are described in
>>     >     http://www.orthuber.com/wpa.htm , don't hesitate to ask me
>>     further
>>     >     questions.
>>     >          --------------------
>>     >     It seems clear that introduction of the above conventions would
>>     >     have relevant advantages. Can this get support that we can
>>     step by
>>     >     step realize this?
>>     >          Regards
>>     >          Wolfgang Orthuber   (Mathematician and Orthodontist at
>>     University
>>     >     of Kiel / Germany)
>>     >
>>     >
>>
>>
>>     -- 
>>
>>     Regards,
>>
>>     Kingsley Idehen          Weblog:
>>     http://www.openlinksw.com/blog/~kidehen
>>     <http://www.openlinksw.com/blog/%7Ekidehen>
>>     President & CEO OpenLink Software     Web: http://www.openlinksw.com
>>
>>
>>
>>
>>
>
>
> -- 
>
>
> Regards,
>
> Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
> President & CEO OpenLink Software     Web: http://www.openlinksw.com
>
>
>
>
>
>
Received on Thursday, 30 April 2009 10:02:06 UTC

This archive was generated by hypermail 2.3.1 : Sunday, 31 March 2013 14:24:20 UTC