Re: Examples using schema.org/bioschemas for dataset landing pages from Martin Fenner on 2018-02-02 (public-bioschemas@w3.org from February 2018)

From: Martin Fenner <martin.fenner@datacite.org>
Date: Fri, 02 Feb 2018 11:12:04 +0000
To: "Rafael C. Jimenez" <rafael.jimenez@elixir-europe.org>
Cc: public-bioschemas@w3.org, Sarala Wimalaratne <sarala@ebi.ac.uk>
Message-ID: <CAG9q6AaYAbjDGUrD-7=X6gQD8jr8FV+8KQDW92+fBxKf0a__qw@mail.gmail.com>
Thanks a lot Rafa. Our current focus is on listing datasets, as we want to
build tools taking advantage of this information in the coming months. But
we of course should look at the bigger picture, and describing for example
catalogs makes a lot of sense. The CSV file is a first version, and I am
sure will have more columns over time. But please keep in mind that this
activity is not only in the life sciences, but data repositories in general
(e.g. you see examples from social sciences, geosciences, high-energy
physics).

Sarala and I should share with the bioschemas community what we are
planning to do in the FREYA project. The primary goal with this activity is
to fetch core metadata from repositories embedding schema.org, which can
for example be used to generate a citation.

Best,

Martin

On Fri, Feb 2, 2018 at 7:36 AM Rafael C. Jimenez <
rafael.jimenez@elixir-europe.org> wrote:

> Hi Martin,
>
> I am sorry for the late reply. I am meeting Sarala later. So I will catch
> up with her as you suggested. If it is useful for you I think we should add
> to your list the life sciences resources we know are exposing datasets
>  with schema.org. I think we should be able to contribute with few more
> resources.
>
> Related to this topic, in March last year in one of our Bioschemas
> meetings
> <https://docs.google.com/document/d/1z1YqqLVntc0wpYCwvHTs5JPPgvfGjUrGdCoVSAAtFE0/edit#heading=h.2rf1bkfns01y>, we
> discussed the idea of creating a registry to facilitate the discover of
> resources adopting bioschemas/schema.org types
> <https://drive.google.com/file/d/0Bw_p-HKWUjHoSlBrUHU1TWZnajQ/view?usp=sharing>.
> We did agree not to create a new registry but enable our existing
> registries (identifiers.org, biosharing.org, bio.tools, TeSS, biosamples,
> ...) to capture/expose how indexed resources support Bioschemas/Schema.org
> types. I imagine you might aim to do something similar in re3data?
>
> As far as I understand you are interested in resources describing just
> datasets with schema.org. We are interested in resources which are
> indexing more types than datasets (eg. tools, training materials, proteins,
> ...). So for us is also important to capture other details like the
> schema.org types and the bioschemas profiles supported.
>
> Until we have the registries enabled I think in Bioschemas we should
> capture this information in a simple list as you do. We just had a
> Bioschemas meeting in Italy in the last two days and we wanted to show
> which resources in the workshop adopted Bioschemas. We created this list
> <https://github.com/BioSchemas/specifications/blob/master/implementations.csv>.
> We are thinking to complete this list with all the resources that we know
> adopted bioschemas. We are happy to add those that are focused on datasets
> to you list as well.
>
> BTW, I see in your list you use a column specific for re3data. As you
> might know we have several catalogues (registries) indexing resources in
> life sciences. Would it make more sense to use IDs that we have in our
> existing registries besides the re3data or additional to re3data? using
> prefixes (eg "re3data:r3d100000044")?
>
> Regards,
> Rafa.
>
>
> On 19 January 2018 at 19:43, Martin Fenner <martin.fenner@datacite.org>
> wrote:
>
>> Dear list,
>>
>> identifiers.org (Sarala) and DataCite (me) have started to generate a
>> list of data resources that embed schema.org metadata in dataset landing
>> pages:
>>
>> https://github.com/datacite/freya/blob/master/schema_org.csv
>>
>> We want to use this list to build pilot applications using the schema.org
>> metadata, and we invite you to add resources we haven't listed yet (via
>> pull request or in GitHub issue
>> https://github.com/datacite/freya/issues/8). And feel free to use the
>> list of currently 19 resources, with unfortunately still only a few from
>> the life sciences.
>>
>> And please reach out to Sarala or me if you want to know more about what
>> we are planning to build.
>>
>> Thanks!
>>
>> Martin
>>
>>
>> --
>>
>> Martin Fenner
>> DataCite Technical Director
>> http://orcid.org/0000-0003-1419-2405
>>
>
>
>
> --
>
> *Rafael C Jimenez*
> ELIXIR Chief Technical Officer
> www.elixir-europe.org
>
> ELIXIR Hub, South Building
> Wellcome Genome Campus
> Hinxton, Cambridge, CB10 1SD, UK
> Tel: +44 (0) 1223 49 2574 <%2B44%20%280%29%201223%20492574>
> E-Mail: rafael.jimenez@elixir-europe.org [image: ELIXIR]
> <http://www.elixir-europe.org/>
>
> --

Martin Fenner
DataCite Technical Director
http://orcid.org/0000-0003-1419-2405
Received on Friday, 2 February 2018 11:12:40 UTC