- From: Ted Thibodeau Jr <tthibodeau@openlinksw.com>
- Date: Fri, 20 Mar 2009 16:38:42 -0400
- To: Kingsley Idehen <kidehen@openlinksw.com>
- Cc: Knud Hinnerk Möller <knud.moeller@deri.org>, giovanni.tummarello@deri.org, Tom Heath <Tom.Heath@talis.com>, public-lod@w3.org
**Knud Hinnerk Möller wrote: >> - not having to worry about serving the data after you have >> produced it * Kingsley Idehen wrote: > Well I may be worried about "sweat and brow" amongst other things > and would like to be attributed via some kind of digital emblem > that serves such purposes i.e. URIs that are bound to a domain > I control. A minor language tweak... "Sweat and brow" should read "sweat of brow" here -- shorthand for "by the sweat of one's brow," meaning "by one's effort, by one's hard work" -- as Kingsley's point is about attribution. If some entity invests time, effort, money, and other resources in building a data-set, which has many URIs coined therein, and another entity then simply absorbs that data and replaces the original URIs with their own -- not with owl:sameAs, but with forward-chaining and potentially then backward-deleting -- then the original entity loses all credit for the "sweat of brow" that went into the original data set's creation. The original creator may not want monetary compensation -- perhaps all they are concerned with is reputation -- but that is equally lost if the URIs they coined are not preserved. Now, the data-set creators do need a namespace in which to coin their URIs. That might be through a service like purlz.net. It might be with a data-set base-URL, like dbpedia.org or bio2rdf.org. It might be through a vanity domain, ted.me (note -- this doesn't exist, but it should if I use it for URI coinage). The namespace *should* be in a real domain which the data-set creator controls, or at least which they realize is going to get "credit" (or blame) for the statements (triples) being made -- and the dog food server *might* be one such, in the cases immediately being discussed, where the URIs are being created explicitly so that data may be hosted there. But if I create a data-set that provides info about people with URIs in the dog food server's name-space, and I want credit for that data, I should mint my own URIs in my own name-space and sameAs-relate them back to the dog food server's name-space. And hopefully, when I then release my data-set under CC:Attribution or similar, my URIs will be preserved even when my data is integrated into the dog food or any other server. (And ... in best practice, I should release *2* data sets. One should be my statements about the people -- e.g., {<name:Bill%20Clinton> a <role:President> .}. The second should be my assertions of data-set inter-linkage -- e.g., {<name:Bill%20Clinton> <owl:sameAs> <http://dbpedia.org/resource/Bill_Clinton> .}. Blending these two into one data set is poor practice, as my descriptions of a person may be accurate (Senator, NY, Democrat), but my sameAs may be wrong Hillary, not Bill). It's easier to correct the interlinks -- or to use someone else's set of interlink assertions -- when they're in their own data-set.) Be seeing you, Ted -- A: Yes. http://www.guckes.net/faq/attribution.html | Q: Are you sure? | | A: Because it reverses the logical flow of conversation. | | | Q: Why is top posting frowned upon? Ted Thibodeau, Jr. // voice +1-781-273-0900 x32 Evangelism & Support // mailto:tthibodeau@openlinksw.com OpenLink Software, Inc. // http://www.openlinksw.com/ http://www.openlinksw.com/weblogs/uda/ OpenLink Blogs http://www.openlinksw.com/weblogs/virtuoso/ http://www.openlinksw.com/blog/~kidehen/ Universal Data Access and Virtual Database Technology Providers
Received on Friday, 20 March 2009 20:40:24 UTC