- From: Michael Brunnbauer <brunni@netestate.de>
- Date: Mon, 3 Jan 2011 18:45:59 +0100
- To: William Waites <ww@styx.org>
- Cc: semantic-web@w3.org
re On Fri, Dec 24, 2010 at 11:34:48AM +0100, William Waites wrote: > For rdf-consuming robots it > really is better to look at the native version (via > content-negotiation or requesting ${uri}.rdf). In this case there are > about 3 million distinct graphs and if you crawl blindly you'll also > get another several million cbds for authors and publishers. At that > rate it may take several years for the crawl to finish... For a dataset of this size, we could make use of a dump in nquads format. Would it be possible to use less blank nodes ? A construct like <rdf:Description rdf:nodeID="b11127987"> <skos:notation>Black Swan</skos:notation> <owl:sameAs rdf:resource="http://bibliographica.org/entity/3683b388bab11b8e411049f7774ee2b7"/> ... </rdf:Description> in http://bnb.bibliographica.org/entry/GB98Z7613.rdf seems unnecessary to me. Regards, Michael Brunnbauer -- ++ Michael Brunnbauer ++ netEstate GmbH ++ Geisenhausener Straße 11a ++ 81379 München ++ Tel +49 89 32 19 77 80 ++ Fax +49 89 32 19 77 89 ++ E-Mail brunni@netestate.de ++ http://www.netestate.de/ ++ ++ Sitz: München, HRB Nr.142452 (Handelsregister B München) ++ USt-IdNr. DE221033342 ++ Geschäftsführer: Michael Brunnbauer, Franz Brunnbauer ++ Prokurist: Dipl. Kfm. (Univ.) Markus Hendel
Received on Monday, 3 January 2011 17:46:29 UTC