- From: Peter DeVries <pete.devries@gmail.com>
- Date: Sat, 18 Sep 2010 15:27:33 -0500
- To: Richard Cyganiak <richard@cyganiak.de>, public-lod@w3.org
- Message-ID: <AANLkTikv5HS7xVHo9xc_U-i_H2i+wyQCUMO-=8cKYLU=@mail.gmail.com>
Thanks Richard,
I have been wondering whether it make sense to subproperty one of the
existing predicates so that there is a distinction between a web page and a
full blown rdf resource.
This would be useful in my work and maybe in others?
I have also been thinking about how to deal with images.
You often want to distinguish between something that is a "confirmed image
of" vs. what someone thinks the image is of.
In terms of species concepts, you want to have a set of representative
images for that concept, but allow others to "tag" their images as
depictions of that concept.
A simple google search for a given scientific name brings up all sorts of
poorly matched images and I suspect that the same might happen with linked
data.
Thanks Again,
- Pete
On Sat, Sep 18, 2010 at 1:42 PM, Richard Cyganiak <richard@cyganiak.de>wrote:
> Peter,
>
>
> On 17 Sep 2010, at 20:48, Peter DeVries wrote:
>
>> I created the SPARQL query below for the TaxonConcept Knowledge Base.
>>
>> It is based on the earlier one posted by Richard Cyganiak.
>>
>> I looked through my RDF for predicates that have in and out links to other
>> data sets.
>>
>> It is not clear to me how to count basic web pages that are not really RDF
>> resources.
>>
>
> I don't think SPARQL has any easy way of distinguishing wether the target
> of a link is “just” a web page or a full-blown RDF resource.
>
>
> Also where in the CKAN description do you differentiate between in links
>> and
>> out links?
>>
>
> An outlink in our parlance is any triple that's hosted on your site where
> the one resource is in your namespace and the other is in someone else's
> namespace. It doesn't matter which resource is in the subject or object
> position.
>
> An inlink is a triple that uses one of your URIs in the subject or object
> position, but is hosted by someone else, in another dataset.
>
> The CKAN record for your dataset only records the outlinks of your dataset.
>
> We find the inlinks by looking at all other CKAN records and see if any of
> them reference your dataset.
>
> Best,
> Richard
>
>
> I am posting the query and results here so others might benefit from them
>> or
>> inform me of something I may be doing incorrectly.
>>
>> Below is the query, after that follows the results as text and I have also
>> attached a .png of the Virtuoso iSPARQL results.
>>
>> - Pete
>> *
>> *
>> PREFIX owl: <http://www.w3.org/2002/07/owl#>
>> PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
>> PREFIX txn: <http://lod.taxonconcept.org/ontology/txn.owl#>
>> PREFIX foaf: <http://xmlns.com/foaf/0.1/>
>> PREFIX umbel: <http://umbel.org/umbel#>
>>
>> SELECT ?domain_s ?domain_o (COUNT(*) AS ?count)
>> WHERE {
>> {
>> SELECT (bif:regexp_substr("http://([^/]*)", STR(?s), 1) AS
>> ?domain_s)
>> (bif:regexp_substr("http://([^/]*)", STR(?o), 1) AS ?domain_o)
>> WHERE {
>> { ?s owl:sameAs ?o }
>> UNION
>> { ?s skos:exactMatch ?o }
>> UNION
>> { ?s skos:broadMatch ?o }
>> UNION
>> { ?s skos:narrowMatch ?o }
>> UNION
>> { ?s skos:relatedMatch ?o }
>> UNION
>> { ?s skos:closeMatch ?o }
>> UNION
>> { ?s txn:speciesConceptHasSpeciesNameString ?o }
>> UNION
>> { ?s txn:speciesNameStringHasSpeciesTaxonConcept ?o }
>> UNION
>> { ?s txn:speciesConceptHasBasionymNameString ?o }
>> UNION
>> { ?s txn:basionymNameStringHasSpeciesTaxonConcept ?o }
>> UNION
>> { ?s txn:hasPDFVersion ?o }
>> UNION
>> { ?s txn:hasAuthorURI ?o }
>> UNION
>> { ?s foaf:page ?o }
>> UNION
>> { ?s foaf:topic ?o }
>> UNION
>> { ?s txn:inDBpediaClade ?o }
>> UNION
>> { ?s txn:occurrenceInContinent ?o }
>> UNION
>> { ?s txn:occurrenceInStateProvince ?o }
>> UNION
>> { ?s txn:occurrenceInCounty ?o }
>> UNION
>> { ?s txn:isExpectedIn ?o }
>> UNION
>> { ?s txn:hasExpectationOf ?o }
>> UNION
>> { ?s txn:isUnknownAboutIn ?o }
>> UNION
>> { ?s txn:hasUnknownExpectationOf ?o }
>> UNION
>> { ?s txn:isUnexpectedIn ?o }
>> UNION
>> { ?s txn:hasUnknownExpectationOf ?o }
>> }
>> }
>> }
>> GROUP BY ?domain_s ?domain_o
>> *
>> *
>> *==============================*
>> *
>> *
>> *domain_s** **domain_o** **count*
>> lod.geospecies.org lod.taxonconcept.org 71757
>> www.uniprot.org lod.taxonconcept.org 23427
>> bio2rdf.org lod.taxonconcept.org 23427
>> dbpedia.org lod.taxonconcept.org 18849
>> eunis.eea.europa.eu lod.taxonconcept.org 2987
>> www.bbc.co.uk lod.taxonconcept.org 318
>> lod.taxonconcept.org lod.geospecies.org 71756
>> lod.taxonconcept.org www.uniprot.org 23427
>> lod.taxonconcept.org bio2rdf.org 23656
>> lod.taxonconcept.org dbpedia.org 95208
>> lod.taxonconcept.org eunis.eea.europa.eu 5974
>> lod.taxonconcept.org www.bbc.co.uk 636
>> rdf.freebase.com lod.taxonconcept.org 119
>> lod.taxonconcept.org 72
>> lod.taxonconcept.org rdf.freebase.com 119
>> lod.taxonconcept.org 24900
>> sw.opencyc.org lod.taxonconcept.org 24
>> lod.taxonconcept.org sw.opencyc.org 24
>> lod.taxonconcept.org gni.globalnames.org 73329
>> gni.globalnames.org lod.taxonconcept.org 73330
>> lod.taxonconcept.org www.americanarachnology.org 1
>> lod.taxonconcept.org assets.geospecies.org 3
>> lod.taxonconcept.org www.itis.gov 42100
>> lod.taxonconcept.org data.gbif.org 1154
>> lod.taxonconcept.org en.wikipedia.org 18849
>> lod.taxonconcept.org species.wikimedia.org 9328
>> lod.taxonconcept.org www.eol.org 579
>> lod.taxonconcept.org www.boldsystems.org 122
>> lod.taxonconcept.org www.catalogueoflife.org 53
>> lod.taxonconcept.org bugguide.net 3297
>> lod.taxonconcept.org lod.taxonconcept.org 287048
>> assets.geospecies.org media.geospecies.org 5
>> lod.taxonconcept.org mushroomobserver.org 5
>> assets.geospecies.org lod.geospecies.org 10
>> assets.geospecies.org lod.taxonconcept.org 1
>> static.flickr.com www.flickr.com 33
>> bugguide.net lod.taxonconcept.org 3297
>> media.geospecies.org lod.taxonconcept.org 19
>> ocs.geospecies.org lod.taxonconcept.org 26
>> media.geospecies.org dbpedia.org 14
>> assets.geospecies.org dbpedia.org 1
>> media.geospecies.org lod.geospecies.org 37
>> mushroomobserver.org lod.taxonconcept.org 5
>> media.geospecies.org media.geospecies.org 29
>> ocs.geospecies.org ocs.geospecies.org 53
>> media.geospecies.org assets.geospecies.org 15
>> media.geospecies.org static.flickr.com 2
>> mushroomobserver.org mushroomobserver.org 3
>> mushroomobserver.org dbpedia.org 1
>> ocs.geospecies.org sws.geonames.org 39
>> lod.taxonconcept.org sws.geonames.org 234792
>> sws.geonames.org lod.taxonconcept.org 128529
>> --
>>
>>
>>
>> ----------------------------------------------------------------
>> Pete DeVries
>> Department of Entomology
>> University of Wisconsin - Madison
>> 445 Russell Laboratories
>> 1630 Linden Drive
>> Madison, WI 53706
>> TaxonConcept Knowledge Base <http://www.taxonconcept.org/> / GeoSpecies
>> Knowledge Base <http://lod.geospecies.org/>
>> About the GeoSpecies Knowledge Base <http://about.geospecies.org/>
>> ------------------------------------------------------------
>> <interlinking_capture.png>
>>
>
>
--
----------------------------------------------------------------
Pete DeVries
Department of Entomology
University of Wisconsin - Madison
445 Russell Laboratories
1630 Linden Drive
Madison, WI 53706
TaxonConcept Knowledge Base <http://www.taxonconcept.org/> / GeoSpecies
Knowledge Base <http://lod.geospecies.org/>
About the GeoSpecies Knowledge Base <http://about.geospecies.org/>
------------------------------------------------------------
Received on Saturday, 18 September 2010 20:28:08 UTC