Re: Size estimates of current LS space

Eric Neumann wrote:
> As per today's Telcon, does any person with genomics knowledge (that 
> includes you too Carole) have estimates for the following numbers:
> 
> 1. How many bio-molecular and organism-anatomical-functional entities 
> and records (broad sense) are currently accessible through the web 
> (excluding LIMS entities, such as samples, for now)?

In theory there are about 45M unique LSIDs in UniProt (27M of which 
represent non-UniProt resources). But note that few of these resources 
follow the rules (such as not changing), and none of these identifiers can 
at the moment be resolved with the official mechanism.


> 2. Does this number grow substantially when it is allowed to include 
> every variant of protein, gene, etc. per species (i.e., not instances of 
> real molecules or organisms)?

The number would increase substantially if you also counted obsolete 
versions (though only few databases have version numbers and support 
retrieval of obsolete versions).

Received on Friday, 4 August 2006 02:18:35 UTC