W3C home > Mailing lists > Public > public-schemaorg@w3.org > September 2016

Re: querying data defined with schema.org

From: Stefan Dietze <stefan.dietze@gmx.de>
Date: Tue, 20 Sep 2016 12:34:27 +0200
To: Ahmet Yıldırım <me@ahmetyildirim.com.tr>, Timothy Holborn <timothy.holborn@gmail.com>
Cc: Raphaël Troncy <raphael.troncy@eurecom.fr>, "schema.org Mailing List" <public-schemaorg@w3.org>
Message-ID: <002adc02-c724-97e0-5d0b-b6e76094d990@gmx.de>
If I understand you correct, what you are looking for is something like 
the Glimmer search engine created by Yahoo Research on top of the markup 
data from the Web Data Commons (webdatacommons.org). An earlier 
announcement about Glimmer from Peter (Mika) is at [1]. Unfortunately 
the actual service [2] seems down since a few days I am afraid. It's of 
course not a search engine across the entire Web, but operates on a 
reasonable crawl, that is the WDC data (extracted from the Common Crawl).

To my knowledge there's no other publicly available service of that sort 
atm, though we are working on some data fusion approaches on 
markup/schema.org data which might result in some public services at 
some point.

Stefan


[1] https://lists.w3.org/Archives/Public/public-vocabs/2013Jun/0094.html
[2] http://glimmer.research.yahoo.com/

On 20.09.2016 12:07, Ahmet Yıldırım wrote:
> Thanks for the answers.
>
> Actually I want to query all the data in all over the world (that
> requires indexing data like google does) using SPARQL or another language.
>
> From the links, what is close as an answer to my question seems
>  https://github.com/schemaorg/schemaorg/wiki/BlazeGraphSPARQLHowto
> But this requires data loaded in advance.
>
> In summary, I want a search engine type of service which allows querying
> all data defined using schema.org <http://schema.org> in all over the
> world. I now this service is a bit hard to manage and implement but just
> taking a chance on this and asking. May be there is a service you know.
>
>
> On Mon, Sep 19, 2016 at 8:37 PM, Timothy Holborn
> <timothy.holborn@gmail.com <mailto:timothy.holborn@gmail.com>> wrote:
>
>
>
>     On Tue., 20 Sep. 2016, 3:19 am Raphaël Troncy,
>     <raphael.troncy@eurecom.fr <mailto:raphael.troncy@eurecom.fr>> wrote:
>
>         Dear Timothy, all,
>
>         > did another search.
>         > https://doremus-anr.github.io/schema-visualizer/
>         <https://doremus-anr.github.io/schema-visualizer/>  is useful.
>
>         Thanks for the spoiler :-) I wanted to make an official
>         advertisement
>         later on ... but well ! For the ones who are interested, the
>         code is at
>         https://github.com/DOREMUS-ANR/schema-visualizer/
>         <https://github.com/DOREMUS-ANR/schema-visualizer/>. This is
>         developed in
>         the context of the French DOREMUS project, http://doremus.org/,
>         which
>         aims to represent and interlink music metadata coming from the main
>         digital libraries in France.
>
>         Our Visualizer demo aims to display rich snippets for *some*
>         schema.org <http://schema.org>
>         types that are useful for us, namely, the Event type (and
>         EventSeries)
>         and the CreativeWork branch in schema.org <http://schema.org>
>         (some sub-types).
>
>         Examples are
>         https://doremus-anr.github.io/schema-visualizer/#?id=57ad9cc4c2ef16159e54e36d
>         <https://doremus-anr.github.io/schema-visualizer/#?id=57ad9cc4c2ef16159e54e36d>
>         or
>         https://doremus-anr.github.io/schema-visualizer/#?id=57ada1d5bd966f0bda6b8a7d
>         <https://doremus-anr.github.io/schema-visualizer/#?id=57ada1d5bd966f0bda6b8a7d>
>
>         > Did a quick fork (and edit) to
>         > make https://webcivics.github.io/jsonld-vis/
>         <https://webcivics.github.io/jsonld-vis/>  in a few minutes...
>         Still
>         > needs some work.
>
>         This seems to be a fork from
>         https://scienceai.github.io/jsonld-vis
>         <https://scienceai.github.io/jsonld-vis> and
>         not from our visualizer, no?
>
>
>     Yup.  I found yours also.  Nice. ;)
>
>     Checkout: https://github.com/solid/solid
>     <https://github.com/solid/solid>
>
>     Might be a starting point to make a read/write solution..
>
>     Tim.h.
>
>
>            Raphaël
>
>         --
>         Raphaël Troncy
>         EURECOM, Campus SophiaTech
>         Data Science Department
>         450 route des Chappes, 06410 Biot, France.
>         e-mail: raphael.troncy@eurecom.fr
>         <mailto:raphael.troncy@eurecom.fr> & raphael.troncy@gmail.com
>         <mailto:raphael.troncy@gmail.com>
>         Tel: +33 (0)4 - 9300 8242 <tel:%2B33%20%280%294%20-%209300%208242>
>         Fax: +33 (0)4 - 9000 8200 <tel:%2B33%20%280%294%20-%209000%208200>
>         Web: http://www.eurecom.fr/~troncy/
>
>
>
>
> --

-- 
Stefan Dietze
Mobile UK: +44 (0)799/9591436
Mobile Germany: +49 (0)179/5939815
E-Mail: stefan.dietze@gmx.de
Web: http://stefandietze.net
Received on Tuesday, 20 September 2016 10:35:01 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 20 September 2016 10:35:02 UTC