- From: Laufer <laufer@globo.com>
- Date: Mon, 19 May 2014 12:22:48 -0300
- To: Ghislain Atemezing <auguste.atemezing@eurecom.fr>
- Cc: Bernadette Farias Loscio <bfl@cin.ufpe.br>, Carlos Iglesias <carlos.iglesias.moro@gmail.com>, Makx Dekkers <mail@makxdekkers.com>, DWBP Public List <public-dwbp-wg@w3.org>
- Message-ID: <CA+pXJij6JiFCgvqYGrh=YowJ8dZoTRE0oUH6+x69dhjjqWn2Mg@mail.gmail.com>
Hi, Ghislain, Sorry about misspelling your name. Cheers, Laufer 2014-05-19 12:03 GMT-03:00 Laufer <laufer@globo.com>: > Hi, Ghislam, > > > >> I could consider also metada "computed" based on some provenance data + > metrics. For e.g.: If a dataset is published by > > a "certified organization" and it is reused by many users/applications, > then it has higher quality. > Interesting. I think it would be necessary to have vocabularies to > describe these "computed" metadata. > > >Metadata Types for Use > >URI Design Principles > >Machine Access to Data > >API specification > >I am not sure to understand the above types. Could you give us an example > why "vocabularies" are not in this list, > >but "URI design principles" is here? One may think that there is no > principles in designing URIs for vocabs. > This is a starting list. We have to discuss it a lot. The name "URI Design > Principles" could not explain what I have thought. I was thinking about a > rule that a dataset could have, in a way that a Consumer could build an > appropriate URI for accessing a specific resource (the definition of > resource used in rdf). > > > >> What's the difference between "format spec" and "data format"? > I think that the terminology has to be well-defined and some terms are > difficult to be understood only by their names. > Some information that is related to the "Use" task could be of the > Consumer interest when choosing the dataset, during the "Search" task . The > two types are related: "Format Specification" and "Data Format". "Date > Format" could be for example, "CSV" and "Format Specification" could be the > semantics of the CSV file (defined by the CSV on the Web WG). But, again, > the names of these metada types have to be defined and described. > > > >>As others pointed out, we could define a small set of mandatory field > when providing the metadata. > +1. DCAT AP also did this kind of thing. > > Cheers, > Laufer > > 2014-05-16 4:26 GMT-03:00 Ghislain Atemezing <auguste.atemezing@eurecom.fr > >: > > Hi Laufer, all, >> Thanks for this great starting discussion. Find below my 2 cents ... >> >>> I created a page on the wiki, "Best Practices – Guidance on the >>> Provision of Metadata", where we can put the information about this >>> topic. I took the liberty to define a prefix in the subject of the >>> e-mails related to these discussions: [BP- MET]. >>> >>> I would like to expose some thoughts that I think are related to the >>> data on the web ecosystem. I see a kind of data architecture that has >>> three big roles: a data Publisher, a data Consumer and a data Broker. >>> The Broker is the one that has information that can be used by the >>> Consumer to find data published by the Publisher. >>> >>> As an example of Brokers we can think about implementations of CKAN, >>> used by data.gov <http://data.gov>, dados.gov.br <http://dados.gov.br>, >>> >>> etc. CKAN has metadata (provided by Publishers) that are useful for >>> Consumers to find data. CKAN is a registry and can also be a repository >>> for the data to be consumed. Almost all use cases of DWBP WG are >>> examples of Brokers. >>> >>> At the same time, data published in CKAN implementations can have >>> multiple formats, as CSV, for example. Once a Consumer chooses some data >>> to use from a Publisher, she needs another kind of metadata to >>> understand how to access the data and its semantics. >>> >>> I propose to create categories and types of metadata. I see two >>> categories: metadata for search and metadata for use. Each of these >>> categories would have types of metadata. For example: >>> >>> +1. I could consider also metada "computed" based on some provenance >> data + metrics. For e.g.: If a dataset is published by a "certified >> organization" and it is reused by many users/applications, then it has >> higher quality. > > > >> Metadata Types for Search >>> >>> Human Content Description (free text) >>> >> ..and categories/themes >> >> >>> Machine Content Description (vocabularies) >>> >>> Provenance >>> >>> License >>> >>> Revenue >>> >>> Credentials >>> >>> Quality / Metrics >>> >>> Release Schedule >>> >>> Data Format >>> >>> Data Access >>> >> +1 for all this first metadata types >> >> >>> Metadata Types for Use >>> >>> URI Design Principles >>> >>> Machine Access to Data >>> >>> API specification >>> >>> I am not sure to understand the above types. Could you give us an >> example why "vocabularies" are not in this list, but "URI design >> principles" is here? One may think that there is no principles in designing >> URIs for vocabs. >> >>> Format Specification >>> >>> What's the difference between "format spec" and "data format"? >> >> As others pointed out, we could define a small set of mandatory field >> when providing the metadata. >> >> Thanks again for taking care of this section. >> >> Cheers, >> Ghislain >> >> -- >> Ghislain Atemezing >> EURECOM, Multimedia Communications Department >> Campus SophiaTech >> 450, route des Chappes, 06410 Biot, France. >> e-mail: auguste.atemezing@eurecom.fr & ghislain.atemezing@gmail.com >> Tel: +33 (0)4 - 9300 8178 >> Fax: +33 (0)4 - 9000 8200 >> Web: http://www.eurecom.fr/~atemezin >> Google+:http://google.com/+GhislainATEMEZING >> Twitter:@gatemezing >> > > > > -- > . . . .. . . > . . . .. > . .. . > -- . . . .. . . . . . .. . .. .
Received on Monday, 19 May 2014 15:23:16 UTC