- From: Hugh Williams <hwilliams@openlinksw.com>
- Date: Wed, 6 Jul 2016 14:14:45 +0100
- To: Jean-Claude Moissinac <jean-claude.moissinac@telecom-paristech.fr>
- Cc: public-lod <public-lod@w3.org>
- Message-Id: <97FE78CA-F44B-4679-BDD8-4A01DF766195@openlinksw.com>
Hi Jean-Claude,
The "select count(distinct ?r) where { ?r ?p ?l }” query is expensive in terms of database resources and would result in a huge hash table being creating to try and service it which is causing it to timeout based on the settings on the instance by whoever maintains it.
On http://dbpedia.org/sparql the original canonical English DBpedia endpoint OpenLink Software hosts, we provide preloaded VOID datasets, such that they don’t have to be queried each time, see http://dbpedia.org/void/Dataset , but the French DBpedia instance does not appear to have this ie http://fr.dbpedia.org/void/Dataset
Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc. // http://www.openlinksw.com/
Weblog -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter -- http://twitter.com/OpenLink
Google+ -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers
> On 6 Jul 2016, at 12:49, Jean-Claude Moissinac <jean-claude.moissinac@telecom-paristech.fr> wrote:
>
> Hello
>
> In my work, I need to know the number of distinct resources in a dataset.
> For example, with dbpedia-fr, I'm trying
> select count(distinct ?r) where { ?r ?p ?l }
>
> And I'm always getting a timeout error message
> While with
> select count(?r) where { ?r ?p ?l }
> I'm getting
> 185404575
>
> Is it a good way to know about such size?
>
> --
> Jean-Claude Moissinac
>
Attachments
- application/pkcs7-signature attachment: smime.p7s
Received on Wednesday, 6 July 2016 13:15:12 UTC