W3C home > Mailing lists > Public > public-lld@w3.org > June 2011

Review of side deliverable "LLD Vocabularies and Datasets"

From: Monica Duke <m.duke@ukoln.ac.uk>
Date: Tue, 21 Jun 2011 14:57:46 +0100
Message-Id: <A294A73F-830E-4856-B3A4-1797899E5CD1@ukoln.ac.uk>
To: public-lld@w3.org
Here is my review of http://www.w3.org/2005/Incubator/lld/wiki/Vocabulary_and_Dataset  Apologies it is one day late. Hope this is helpful, please do ask if anything is unclear.

General:
The title of the deliverable names vocabularies first, then datasets, the table of contents (and the contents) have this reversed: make title consistent with content (also title says Vocabulary instead of Vocabularies - in blue)
The deliverable is about 3 types of things, metadata element sets, vocabularies and datasets. Vocabs and datasets are grouped together in Section 2, and metadata elements sets get section 3.  It seems to me that metadata elements sets are at the same level as the other two and could/should be included in the title of the deliverable?
The order of the definitions, the title and the sections should all be consistent.
Section numbers alongside the section headings within the text (not just in contents) would be helpful.
Why do Datasets and value vocabularies not get a main section each (but grouped in section 2 instead)?
Replace all 'cases' with 'use cases' when used in to refer to the use cases (inc. in section title 3.3)
There are a number of TODOs that will need to be checked.

Introduction:
Need to explain the relation of this document to the W3C incubator group and its work (suggest adding a third sentence after 'refresher' and before 'As our incubator reminds in its its recommendations'.
'As our incubator reminds in its its recommendations,' replace with 'The report of the incubator group suggests that' - the link goes to the issues page not the recommendations page, is that correct?
'any domain indeed relies' - remove 'indeed'
'Far from it, the complexity and variety of library data resources, many of them already available as linked data at the time of writing this report, makes such an identification effort crucial.' replace with 'Such an identification effort is crucial given the complexity and variety of library data resources, many of them already available as linked data at the time of writing this report.'
Suggest adding a sentence (this or similar wording): We hope that this report will help those who undertake such a task. 
'which as shown later are non mutually exclusive' replace with 'which are non mutually exclusive (as shown later)'

Definitions:
Metadata element sets - I am sorry but this definition has really confused me, especially the use of the words entities and elements, and the distinction between them. Are we saying that metadata element sets sometimes define entities and sometimes define elements? The examples help, but until I got to the examples, the definition made no sense to me (and I am worried that the examples would not help anyone who was not already familiar with FRBR, DC etc).
suggest 'A metadata element set defines classes of entities and attributes (elements) of entities.' replace with 'A metadata element set defines classes of entities and attributes  of entities (elements).'
'such element sets are materialized' replace with 'made concrete' or 'instantiated'
'Usually a metadata element set does not define bibliographic entities' seems to be in contradiction with 'A metadata element set defines classes of entities' (or is it the word classes that is important there - use italics?).

Value Vocabularies - '(topics, art styles, authors)' do we mean '(instances of topics, art styles, authors)'?
'They are "building blocks" with which metadata records can be built.' - 'populated' instead of 'built'.
'Many libraries require specific value vocabularies as mandatory' replace with 'Many libraries mandate specific value vocabularies' 
'Resources that can be considered as' replace with 'Examples of'
'Note however that' - remove
In the Examples give an example in brackets (e.g. an actual topic value)
Art and Architecture - I don't understand what a.o. is
GeoNames - put name of a city instead of (e.g. cities)

Datasets
Add as second sentence (this, or similar wording): The equivalent of a dataset in the library world is a collection of Library records.
'grounded by the cases' - replace with 'grounded by the use cases'

'We do not aim here to draw a complete list of the various resources related to the (library) linked data "cloud". As said, this report is rather intended as an entry point for practitioners to find, understand and explore some exemplar resources. It is especially grounded by the cases our incubator group has gathered.' 
replace with 
'This report is intended as an entry point for practitioners to find, understand and explore some exemplar Metadata Element Sets, Value Vocabularies and Datasets. It is especially grounded by the use cases our incubator group has gathered. We do not aim here to draw a complete list of the various resources related to the (library) linked data "cloud". '

'We hope it will prove an inspirational complement to more complete listing tools such as Semantic Web search engines, like Sindice or Falcons, or registries such as the Metadata Registry or CKAN'  
replace with
'We hope it will prove an inspirational complement to more complete listing tools such as Semantic Web search engines (like Sindice or Falcons), or registries such as the Metadata Registry or CKAN'

Datasets and Value Vocabularies
'CKAN is a metadata registry for datasets.' - remove 'metadata' (because we are using metadata in one of the definitions. Also makes it clear we are using datasets in the sense we have defined it here)
'membership the wider' replace with 'membership of the wider'

Published Datasets
You will need one sentence to explain that this is a snapshot taken on <date> etc

Published value vocabularies
'Cases collected by the LLD XG are also listed under each entry, when they refer to the value vocabulary.' replace with 'Cases collected by the LLD XG that refer to the value vocabulary are also listed under each entry.'

Relevant LLD Metadata element sets - anno 2011
Remove 'Relevant LLD' - these are assumed from context (And also for consistency with other section titles) - possibly same can be said for anno 2011 (can be assumed)
Need one sentence in the intro to explain that this section also lists some that are not yet published as RDF in subsections (and why these ones?)

'These include some the most relevant' replace with 'These include some of the most relevant'
'emphasized on' remove 'on'
'as identified by the gathered by the LLD Incubator Group.' identified by OR gathered by; the link is broken.
'any kind of bibliographic things' replace with 'any kind of bibliographic thing'
'thousands coherently' replace with 'thousands of coherently'
'provides with the classes' ??
'fo describing music' replace with 'for describing music'
'It's work' replace with 'Its work'
The link to SPECTRUM did not work.

3.1, 3.2 section titles -  ontologies vs Semantic Web ontologies - make consistent
3.3., 3.4 - uses 'RDF version'  - does this need to be consistent with 3.1, 3.2 and use ontologies (or vice versa)
Some inconsistency in using 'published' in section titles vs 'create' to mean 'make available'
Received on Tuesday, 21 June 2011 13:58:24 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 21 June 2011 13:58:24 GMT