- From: Michael Brunnbauer <brunni@netestate.de>
- Date: Mon, 3 Jan 2011 18:45:59 +0100
- To: William Waites <ww@styx.org>
- Cc: semantic-web@w3.org
re
On Fri, Dec 24, 2010 at 11:34:48AM +0100, William Waites wrote:
> For rdf-consuming robots it
> really is better to look at the native version (via
> content-negotiation or requesting ${uri}.rdf). In this case there are
> about 3 million distinct graphs and if you crawl blindly you'll also
> get another several million cbds for authors and publishers. At that
> rate it may take several years for the crawl to finish...
For a dataset of this size, we could make use of a dump in nquads format.
Would it be possible to use less blank nodes ? A construct like
<rdf:Description rdf:nodeID="b11127987">
<skos:notation>Black Swan</skos:notation>
<owl:sameAs rdf:resource="http://bibliographica.org/entity/3683b388bab11b8e411049f7774ee2b7"/>
...
</rdf:Description>
in http://bnb.bibliographica.org/entry/GB98Z7613.rdf seems unnecessary to me.
Regards,
Michael Brunnbauer
--
++ Michael Brunnbauer
++ netEstate GmbH
++ Geisenhausener Straße 11a
++ 81379 München
++ Tel +49 89 32 19 77 80
++ Fax +49 89 32 19 77 89
++ E-Mail brunni@netestate.de
++ http://www.netestate.de/
++
++ Sitz: München, HRB Nr.142452 (Handelsregister B München)
++ USt-IdNr. DE221033342
++ Geschäftsführer: Michael Brunnbauer, Franz Brunnbauer
++ Prokurist: Dipl. Kfm. (Univ.) Markus Hendel
Received on Monday, 3 January 2011 17:46:29 UTC