W3C home > Mailing lists > Public > public-semweb-lifesci@w3.org > February 2006

Re: Oracle Uniprot RDF data set and benchmarks

From: Susie Stephens <susie.stephens@oracle.com>
Date: Wed, 08 Feb 2006 09:26:52 -0500
Message-ID: <43E9FFAC.8070701@oracle.com>
To: public-semweb-lifesci@w3.org

I will find out more about the Uniprot subgraph that we used for the 
VLDB paper, and see if we can make it available.

However, I really like Eric Jain's offer of providing stable data sets 
of different sizes for benchmarking. It makes sense to me to have an 
independent organization providing the data sets.

Susie






Eric Miller wrote:

>
>
> On Feb 8, 2006, at 6:22 AM, Eric Jain wrote:
>
>>
>> Ian Wilson wrote:
>>
>>> We will thus want to maintain a local copy of this extract (on the  
>>> wiki?) so changes in the graph don't change the benchmarking results.
>>
>>
>> The data in http://www.isb-sib.ch/~ejain/rdf/data/ is indeed  updated 
>> every two weeks, but I could also provide some more stable  data sets 
>> for benchmarking if there is interest, perhaps with 1M,  10M and 100M 
>> triples?
>
>
> I think this would be extremely useful for a variety of communities  
> trying to assess issues of scalability; the more "connected" graphs  
> subsets for testing, the better.
>
> thanks in advance!
>
> -- 
> eric miller                              http://www.w3.org/people/em/
> semantic web activity lead               http://www.w3.org/2001/sw/
> w3c world wide web consortium            http://www.w3.org/
>
>
>
Received on Wednesday, 8 February 2006 14:27:05 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 18:00:42 GMT