W3C home > Mailing lists > Public > public-lod@w3.org > October 2018

Use the void and schema.org [Was; Re: LODAtlas 1.0 release - Browsing Linked Data Catalogs]

From: Jerven Bolleman <me@jerven.eu>
Date: Thu, 4 Oct 2018 09:33:29 +0200
Message-ID: <CAHM_hUMD9K-AZk4V9f5fK_15xr+ztsV++AUxffftFJ29S9bX8Q@mail.gmail.com>
To: emmanuel.pietriga@inria.fr
Cc: ettorerizza@gmail.com, Linking Open Data <public-lod@w3.org>
Dear Emmanuel, Community,

This is a lovely UI, but let down by it's data. There is some much good
data missing and misrepresented because of the bad, wrong and obsolete data
in datahub.io. Datahub approaches do not scale and are not maintainable for
data providers.

Like the LOD diagram it misses a lot of key datasets produced by
professionals every month as these professionals don't have time to update
ten forms in different places. However, these people do produce schema.org
markup and/or extensive void files.

Why are you not parsing these? How come I need to fill in a form instead of
just submitting an URI where there is schema.org or a void file? Why do I
need to update your database by hand, again and again.

Why does the LOD community that produces tools not actually consume LOD to
drive them?

Regards,
Jerven

On Wed, Oct 3, 2018 at 8:34 PM emmanuel.pietriga@inria.fr <
emmanuel.pietriga@inria.fr> wrote:

> [Continuing this thread on the public-lod list only]
>
> This is a good point.
>
> The short answer is: because we bootstrapped LODAtlas with content from
> old.datahub.io (and others), using datasets that were tagged as “lod”.
> Wikidata is in there, but is not tagged as “lod". Actually, it isn’t tagged
> with anything we had considered in the first place. See [1].
>
> [1] https://old.datahub.io/dataset/wikidata
>
> Longer answer: of course, you are right. It should definitely be in there.
> Like many others, probably. Regarding wikidata, we’ll add it ourselves.
> Other datasets that belong there: you can use the submission form to add
> them, or point us at them and we’ll see what we can do. We have limited
> resources to handle such requests for now though, but we’ll do the best we
> can.
>
> best regards,
> Emmanuel
> --
> Emmanuel Pietriga
> INRIA - ILDA
> http://pages.saclay.inria.fr/emmanuel.pietriga
>
>
>
> > On 3 Oct 2018, at 19:07, Ettore RIZZA <ettorerizza@gmail.com> wrote:
> >
> > Hello,
> >
> > Thank you very much, this kind of tool interests me a lot. Just a note:
> I see that Wikidata is not there. Are not you afraid that an atlas of the
> LOD without a so huge dataset would be like an atlas of the world without
> China or Russia?
> >
> > Best regards,
> >
> > Ettore Rizza
> >
> > On Wed, 3 Oct 2018 at 16:45, Hande Gözükan <hande.gozukan@inria.fr>
> wrote:
> > Dear All,
> >
> > We are happy to announce the release of LODAtlas version 1.0 [1]
> developed by team Ilda at Inria [2].
> >
> > LODAtlas is a Web tool that helps users find linked datasets of interest
> through faceted browsing + keyword & URI search on the datasets' metadata
> and their schema-level content. The tool provides a set of interactive
> visualization widgets that help compare datasets along different criteria
> (number of triples, last update, interlinking with other datasets in the
> LOD cloud, etc.). Users can also get an idea of the contents of a given
> dataset thanks to a visual summary of the statements it contains. LODAtlas
> also provides a REST API that provides programmatic access to most of the
> data that can be visualized [3].
> >
> > A talk about LODAtlas will be given at ISWC next week in Monterey, CA,
> USA [4a] (full paper available from [4b]).
> >
> > [1] http://lodatlas.lri.fr
> > [2] https://ilda.saclay.inria.fr
> > [3] http://lodatlas.lri.fr/api
> > [4a]
> http://iswc2018.semanticweb.org/sessions/browsing-linked-data-catalogs-with-lodatlas/
> > [4b] https://hal.inria.fr/hal-01827766/document
> >
> > Short description:
> >
> > LODAtlas takes as input CKAN [5] dataset descriptions. The LODAtlas
> instance at [1] gives access to the entire (old) DataHub catalogue [6],
> that of data.gov [7], and partial access to the EU data portal [8] (data
> processing is still ongoing).
> >
> > [5] https://docs.ckan.org/en/2.8/contents.html
> > [6] https://old.datahub.io
> > [7] https://www.data.gov
> > [8] https://www.europeandataportal.eu
> >
> > Data processing comprises the following main steps:
> > - Download the metadata describing linked datasets from the CKAN
> repository.
> > - Download the associated RDF dump files (when available).
> > - Process the dump files using LODStats [9] to extract classes,
> properties and vocabularies.
> > - Process dump files together with schema/ontology files using the RDF
> Quotients framework [10] to generate visual summaries of the dumps’
> contents.
> >
> > LODAtlas is developed as an open source project under GNU General Public
> License v3.0. The source code is hosted on GitLab [11].
> >
> > [9] https://github.com/AKSW/LODStats
> > [10] https://hal.inria.fr/hal-01325900
> > [11] https://gitlab.inria.fr/epietrig/LODAtlas
> >
> > LODAtlas instances can be set up by anyone, using CKAN-compliant linked
> dataset descriptions. LODAtlas is available as a Docker image, or can be
> compiled locally.
> >
> > Any feedback is appreciated!
> >
> > Best regards,
> > The LODAtlas team.
>
>
>

-- 
Jerven Bolleman
me@jerven.eu
Received on Thursday, 4 October 2018 07:34:05 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 16:22:47 UTC