Re: Potential Home for LOD Data Sets

Juan, All,

On Fri, Dec 5, 2008 at 3:03 AM, Juan Sequeda <juanfederico@gmail.com> wrote:
> There is something that I don't understand and would really appreciate an
> explanation.
>
> I understand this opportunity and need for storing LOD data sets on Amazon
> (scalability, efficiency, etc). But is this the way LOD will be forever? For
> now on? Is all LOD going to be stored here? Who is going to pay for this?

*Amazon* is going to pay for storing and keeping a handy set of
"public linked data sets" ESBs.

Then, if you want to setup some triplestore to host the CIA world
factbook + {you name it}, this will eventually save you the cost of
uploading the data to your AMI: time and bandwidth.

OTOH. Putting some RDF datasets in that repo will only increase
awareness of the vast amount of RDF-enabled data.
And that can't be bad.
And remember these audience is already segmented ( they are
"developers looking for public data!" ) and potentially large.

The Amazon has plenty of fish in it.

Best,
A

>
> Just the way the Web of Documents is, where each individual stores and
> shares their documents however they want (own servers, cloud on Amazon,
> etc), shouldn't the Web of Data be doing the same thing? What am I missing
> here?
>
> Thanks for the insight!
>
> Juan Sequeda, Ph.D Student
>
> Research Assistant
> Dept. of Computer Sciences
> The University of Texas at Austin
> http://www.cs.utexas.edu/~jsequeda
> jsequeda@cs.utexas.edu
>
> http://www.juansequeda.com/
>
> Semantic Web in Austin: http://juansequeda.blogspot.com/
>
>
> On Thu, Dec 4, 2008 at 11:28 PM, रविंदर ठाकुर (ravinder thakur)
> <ravinderthakur@gmail.com> wrote:
>>
>> my suggestion is that lets just collect few 100$  (10$ each ?) and
>> purchase a EC2 machine upload it with _all_ semantic data, run a sparql
>> endpoint on it and keep it running for everyone's use.
>>
>>
>>
>> On Fri, Dec 5, 2008 at 10:50 AM, Kingsley Idehen <kidehen@openlinksw.com>
>> wrote:
>>>
>>> Hugh Glaser wrote:
>>>>
>>>> Thanks for the swift response!
>>>> I'm still puzzled - sorry to be slow.
>>>> http://aws.amazon.com/publicdatasets/#2
>>>> Says:
>>>> Amazon EC2 customers can access this data by creating their own personal
>>>> Amazon EBS volumes, using the public data set snapshots as a starting point.
>>>> They can then access, modify and perform computation on these volumes
>>>> directly using their Amazon EC2 instances and just pay for the compute and
>>>> storage resources that they use.
>>>>  Does this not mean it costs me money on my EC2 account? Or is there
>>>> some other way of accessing the data? Or am I looking at the wrong bit?
>>>>
>>>
>>> Okay, I see what I overlooked: the cost of paying for an AMI that mounts
>>> these EBS volumes, even though Amazon is charging $0.00 for uploading these
>>> huge amounts of data where it would usually charge.
>>>
>>> So to conclude, using the loaded data sets isn't free, but I think we
>>> have to be somewhat appreciative of a value here, right? Amazon is providing
>>> a service that is ultimately pegged to usage (utility model), and the usage
>>> comes down to value associated with that scarce resource called time.
>>>>
>>>> Ie Can you give me a clue how to get at the data without using my credit
>>>> card please? :-)
>>>>
>>>
>>> You can't you will need someone to build an EC2 service for you and eat
>>> the costs on your behalf. Of course such a service isn't impossible in a
>>> "Numerati" [1] economy, but we aren't quite there yet, need the Linked Data
>>> Web in place first :-)
>>>
>>> Links:
>>>
>>> 1. http://tinyurl.com/64gsan
>>>
>>> Kingsley
>>>>
>>>> Best
>>>> Hugh
>>>>
>>>> On 05/12/2008 02:28, "Kingsley Idehen" <kidehen@openlinksw.com> wrote:
>>>>
>>>>
>>>>
>>>> Hugh Glaser wrote:
>>>>
>>>>>
>>>>> Exciting stuff, Kingsley.
>>>>> I'm not quite sure I have worked out how I might use it though.
>>>>> The page says that hosting data is clearly free, but I can't see how to
>>>>> get at it without paying for it as an EC2 customer.
>>>>> Is this right?
>>>>> Cheers
>>>>>
>>>>>
>>>>
>>>> Hugh,
>>>>
>>>> No, shouldn't cost anything if the LOD data sets are hosted in this
>>>> particular location :-)
>>>>
>>>>
>>>> Kingsley
>>>>
>>>>>
>>>>> Hugh
>>>>>
>>>>>
>>>>> On 01/12/2008 15:30, "Kingsley Idehen" <kidehen@openlinksw.com> wrote:
>>>>>
>>>>>
>>>>>
>>>>> All,
>>>>>
>>>>> Please see: <http://aws.amazon.com/publicdatasets/> ; potentially the
>>>>> final destination of all published RDF archives from the LOD cloud.
>>>>>
>>>>> I've already made a request on behalf of LOD, but additional requests
>>>>> from the community will accelerate the general comprehension and
>>>>> awareness at Amazon.
>>>>>
>>>>> Once the data sets are available from Amazon, database constructions
>>>>> costs will be significantly alleviated.
>>>>>
>>>>> We have DBpedia reconstruction down to 1.5 hrs (or less) based on
>>>>> Virtuoso's in-built integration with Amazon S3 for backup and
>>>>> restoration etc..  We could get the reconstruction of the entire LOD
>>>>> cloud down to some interesting numbers once all the data is situated in
>>>>> an Amazon data center.
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>>
>>>>> Regards,
>>>>>
>>>>> Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
>>>>> President & CEO
>>>>> OpenLink Software     Web: http://www.openlinksw.com
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>>
>>>> Regards,
>>>>
>>>> Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
>>>> President & CEO
>>>> OpenLink Software     Web: http://www.openlinksw.com
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>>
>>>
>>> Regards,
>>>
>>> Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
>>> President & CEO OpenLink Software     Web: http://www.openlinksw.com
>>>
>>>
>>>
>>>
>>>
>>
>
>



-- 
Aldo Bucchi
U N I V R Z
Office: +56 2 795 4532
Mobile:+56 9 7623 8653
skype:aldo.bucchi
http://www.univrz.com/
http://aldobucchi.com

PRIVILEGED AND CONFIDENTIAL INFORMATION
This message is only for the use of the individual or entity to which it is
addressed and may contain information that is privileged and confidential. If
you are not the intended recipient, please do not distribute or copy this
communication, by e-mail or otherwise. Instead, please notify us immediately by
return e-mail.
INFORMACIÓN PRIVILEGIADA Y CONFIDENCIAL
Este mensaje está destinado sólo a la persona u organización al cual está
dirigido y podría contener información privilegiada y confidencial. Si usted no
es el destinatario, por favor no distribuya ni copie esta comunicación, por
email o por otra vía. Por el contrario, por favor notifíquenos inmediatamente
vía e-mail.

Received on Friday, 5 December 2008 06:38:04 UTC