Re: [WN] Fwd: WordNet Namespace

Jacco van Ossenbruggen wrote:
> 
> Jeremy Carroll wrote:
> 
>> How big is the file if you use hash URIs?
> 
> I'm not sure we mean the same thing by hash URIs.  I was talking about 
> hash URIs as in http://wordnet.princeton.edu/rdf#entity versus non-hash 
> URIs as in http://wordnet.princeton.edu/rdf/entity.  

Me too.


> I do not see how 
> the difference relates to file size.  


Let's suppose we have 150MB of data.

If we have http://wordnet.princeton.edu/rdf#entity then we have one file 
of 150MB. If we want to look up this URI, we have to download 150MB from 
http://wordnet.princeton.edu/rdf and then parse it and find the triples 
concerning http://wordnet.princeton.edu/rdf#entity

If we have http://wordnet.princeton.edu/rdf/entity then we may have say 
50,000 files each of 4KB (notice this is somewhat more in total, say 200MB).

If we want to lookup http://wordnet.princeton.edu/rdf/entity then we 
download 4KB.

The latter is much more practical.

Jeremy




> BTW, the current files are using 
> hash URIs:
> 
> <?xml version='1.0' encoding='UTF-8'?>
> <!DOCTYPE rdf:RDF [
>    <!ENTITY wn 'http://wordnet.princeton.edu/wn#'>
>    <!ENTITY rdf 'http://www.w3.org/1999/02/22-rdf-syntax-ns#'>
>    <!ENTITY rdfs 'http://www.w3.org/2000/01/rdf-schema#'>
> ]>
> 
> <rdf:RDF  xmlns:wn="&wn;" xmlns:rdf="&rdf;" xmlns:rdfs="&rdfs;" 
> xml:lang="en">
> <wn:Word rdf:about="&wn;entity"
>    wn:lexicalForm="entity">
>  <wn:sense rdf:resource="&wn;entity-n-1"/>
> </wn:Word>
> 
> I do not see how you can safe space by changing the URIs.
> 
>> Isn't the size a showstopper for hashes in this application?
>>
>> I had always believed that this was one of the primary examples why 
>> hash URIs were not the only true way.
> 
> I'm missing something here.  Can you elaborate on this?
> 
> Thanks, Jacco
> 
> 

Received on Tuesday, 13 December 2005 14:14:46 UTC