W3C home > Mailing lists > Public > public-lod@w3.org > October 2010

Re: Low Quality Data (was before Re: AW: ANN: LOD Cloud - Statistics and compliance with best practices)

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Mon, 25 Oct 2010 19:39:37 -0400
Message-ID: <4CC61539.5060003@openlinksw.com>
To: Christian Fuerber <c.fuerber@unibw.de>
CC: juanfederico@gmail.com, public-lod@w3.org, martin.hepp@ebusiness-unibw.org
On 10/25/10 3:17 PM, Christian Fuerber wrote:
> Hi Juan,
> I have just published a post on data quality defining sources in response to
> Leigh's question at
> http://www.semanticoverflow.com/questions/1072/quality-indicators-for-linked
> -data-datasets
> Maybe this additionally helps you to find out what data quality really is.
> Cheers,
> Christian
> ------------------------------------------
> Dipl.-Kfm. Christian Fürber
> Professur für Allgemeine BWL, insbesondere E-Business
> e-business&  web science research group
> Universität der Bundeswehr München
> e-mail: c.fuerber@unibw.de
> www:   http://www.unibw.de/ebusiness/
> homepage: http://www.fuerber.com/
> skype: c.fuerber
> twitter: cfuerber


No matter how you cut it, this matter is inherently subjective, ditto 
every comment I am going to make about this matter via my comments below:

We have to understand and accept that heterogeneity is a fact of life 
that is magnified by the Web.

In the real world we coalesce around "world views" and their subjective 

You can never explicitly deem one data space or the data sets it hosts 
as being canonically high or low quality. Of course, said data sets or 
host data spaces may or may not appropriately serve a specific data 
driven need for: a human, humans, agents, or a collection of agents 
working on behalf of humans.

Nothing wrong with constraints that serve the needs of a specific data 
driven task, we just can't deem any subjective criteria as canonical re. 
data quality, in a general sense.

One person's Spam is another person's Ham. Such is the case in the 
real-world and so it shall remain re. Web of Linked Data. Context is king!

IMHO. The beauty of the Web of Linked lies in our ability to "agree to 
disagree" without shedding an ounce of blood. Basically, we arrive at 
deeper insights via true exploitation of gestalt -- which doesn't 
require imposition of absolute truth on anyone. Heterogeneity is the 
spice of life. We are inherently imperfect by design.



Kingsley Idehen	
President&  CEO
OpenLink Software
Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca: kidehen
Received on Monday, 25 October 2010 23:40:20 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 15:16:09 UTC