- From: Lorenz Buehmann <buehmann@informatik.uni-leipzig.de>
- Date: Thu, 12 Jan 2023 08:56:58 +0100
- To: public-lod@w3.org
- Message-ID: <2929e5c1-7542-f88d-9894-b5fba22edc0d@informatik.uni-leipzig.de>
On 12.01.23 01:33, Dan Brickley wrote: > > Really cool! :) > > If anyone has eg student project possibilities, it would be great to > see some work on Wikidata SPARQL query portability- eg working through > the list at query.wikidata.org <http://query.wikidata.org>, which tend > to look like this: > > > SELECT ?item ?itemLabel > WHERE > { > ?item wdt:P31 wd:Q146. # Must be of a cat > SERVICE wikibase:label { bd:serviceParam wikibase:language > "[AUTO_LANGUAGE],en". } # Helps get the label in your language, if > not, then en language > } > > which won’t work as-is outside of the current Wikidata SPARQL > Blazegraph endpoint. > > Something like this is needed (with a filter for lang too): > > > SELECT ?item ?itemLabel > WHERE > { > ?item wdt:P31 wd:Q146; rdfs:label ?itemLabel > } > > I don’t recall where the Wikidata sample queries live (github? Wiki > somewhere) but it would be lovely to hear if they could all run on an > alternative backend… There are plenty of those collections, one is a Wiki: https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples And yes, there are queries that can't be run in its current form on other triple store. In particular all queries that make use of SERVICE request to their internal setup like the MWAPI, the LABEL service, their GAS API (graph traversal), Geospatial extension etc. can't be run outside of Blazegraph. Some of those could be (partially) rewritten to standard SPARQL, but might indeed lead to performance issues. For the queries with GAS I doubt this can be replaced completely, even not with property paths, and other triple stores have their own graph traversal implementation nowadays (e.g. Stardog, GraphDB, Virtuoso). For the MWAPI, only the entity search feature could be rewritten, but even then, SPARQL has no standard for efficient fulltext search (yet ... hope for SPARQL 1.2). The spatial extension should be replaceable with GeoSPARQL and then we have to hope that the triple stores provide full GeoSPARQL support Lorenz > > Dan > > > On Wed, 11 Jan 2023 at 15:52, Kingsley Idehen via Wikidata > <wikidata@lists.wikimedia.org> wrote: > > All, > > We are pleased to announce immediate availability of an new > Virtuoso-hosted Wikidata instance based on the most recent > datasets. This instance comprises 17 billion+ RDF triples. > > Host Machine Info: > > Item Value > > CPU > > > > |2x Intel(R) Xeon(R) CPU E5-2630 0 @ 2.30GHz| > > Cores > > > > |24| > > Memory > > > > |378 GB| > > SSD > > > > |4x Crucial M4 SSD 500 GB| > > > Cloud related costs for a self-hosted variant, assuming: > > * > > dedicated machine for 1 year without upfront costs > > * > > 128 GiB memory > > * > > 16 cores or more > > * > > 512GB SSD for the database > > * > > 3T outgoing internet traffic (based on our DBpedia statistics) > > > vendor machine type memory vCPUs monthly machine monthly disk > monthly network monthly total > > Amazon > > > > r5a.4xlarge > > > > 128 GiB > > > > 16 > > > > $479.61 > > > > $55.96 > > > > $276.48 > > > > $812.05 > > Google > > > > e2highmem-16 > > > > 128 GiB > > > > 16 > > > > $594.55 > > > > $95.74 > > > > $255.00 > > > > $945.30 > > Azure > > > > D32a > > > > 128 GiB > > > > 32 > > > > $769.16 > > > > $38.40 > > > > $252.30 > > > > $1,060.06 > > > SPARQL Query and Full Text Search service endpoints: > > * > > https://wikidata.demo.openlinksw.com/sparql -- SPARQL Query > Services Endpoint > > * > > https://wikidata.demo.openlinksw.com/fct -- Faceted Search & > Browsing > > > Additional Information > > * > > Loading the Wikidata dataset 2022/12 into Virtuoso Open Source > - Announcements - OpenLink Software Community (openlinksw.com) > <https://community.openlinksw.com/t/loading-the-wikidata-dataset-2022-12-into-virtuoso-open-source/3580> > > > Happy New Year! > > -- > Regards, > > Kingsley Idehen > Founder & CEO > OpenLink Software > Home Page:http://www.openlinksw.com > Community Support:https://community.openlinksw.com > Weblogs (Blogs): > Company Blog:https://medium.com/openlink-software-blog > Virtuoso Blog:https://medium.com/virtuoso-blog > Data Access Drivers Blog:https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers > > Personal Weblogs (Blogs): > Medium Blog:https://medium.com/@kidehen > Legacy Blogs:http://www.openlinksw.com/blog/~kidehen/ > http://kidehen.blogspot.com > > Profile Pages: > Pinterest:https://www.pinterest.com/kidehen/ > Quora:https://www.quora.com/profile/Kingsley-Uyi-Idehen > Twitter:https://twitter.com/kidehen > Google+:https://plus.google.com/+KingsleyIdehen/about > LinkedIn:http://www.linkedin.com/in/kidehen > > Web Identities (WebID): > Personal:http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i > :http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this > > _______________________________________________ > Wikidata mailing list -- wikidata@lists.wikimedia.org > Public archives at > https://lists.wikimedia.org/hyperkitty/list/wikidata@lists.wikimedia.org/message/TI7U5Q6ZBEEPCNSTZ2KYLEXEDO4E4GMG/ > To unsubscribe send an email to wikidata-leave@lists.wikimedia.org >
Received on Thursday, 12 January 2023 07:58:00 UTC