Minutes, DCMI/LLD meeting on application profiles, Oct 22

Dear all,

The minutes of the Joint meeting of the DCMI Architecture
Forum and the W3C Library Linked Data Incubator Group in
Pittsburgh on Friday, 22 October, about "Application Profiles
for Linked Data: models and requirements" are available at
[1] and attached below.

Follow-up discussion on application profiles in general will
continue on the public-lld mailing list; Mikael Nilsson
has posted some reactions which, I hope, will lead to a
productive discussion on identifying potentially different
types of application profiles for addressing different types
of requirements [2].

Follow-up discussion specifically about the future development
of DCMI specifications such as the DCMI Abstract Model and 
Description Set Profile constraint language will take place
on the mailing list of the DCMI Architecture Forum [3].

Tom




[1] http://www.w3.org/2005/Incubator/lld/minutes/2010/10/22-lld-minutes.html
[2] http://lists.w3.org/Archives/Public/public-lld/2010Oct/0098.html
[3] http://www.jiscmail.ac.uk/lists/dc-architecture.html

----------------------------------------------------------------------
   [1]W3C

      [1] http://www.w3.org/

  DCMI Architecture Forum and W3C Library Linked Data Incubator Group
  joint meeting on "Application Profiles for Linked Data: models and
                             requirements"

22 Oct 2010

   [2]Agenda

      [2] http://www.w3.org/2001/sw/wiki/JointMeeting2010

   See also: [3]IRC log

      [3] http://www.w3.org/2010/10/22-lld-irc

Attendees

   Present, Regrets
          see http://www.w3.org/2001/sw/wiki/JointMeeting2010 for
          Library Linked Data XG participants. Many DC attendees were
          participating

   Chairs
          Tom Baker, Antoine Isaac, Emmanuelle Bermes

   Scribe
          Mark van Assem, Kai Eckert, Ed Summers

Contents

     * [4]Topics
     * [5]Review of DCMI Abstract Model and possible options
     * [6]Application Profiles and OWL
     * [7]Application profile uses
     * [8]Brainstorming
     _________________________________________________________

   <TomB> Scribe: Mark van Assem

   Antoine: welcome joint meeting DC architecture group and W3C LLD
   ... issues with DCAM and APs built on them, explore options,
   consequences for future activities

   Antoine: TomB presents his work
   ... 2nd presentation Michael Panzer et al
   ... after coffee break exploration of reqs of APs in context of
   subject authority data
   ... then informal discussion

Review of DCMI Abstract Model and possible options

   Presentation from Tom:
   [9]http://www.w3.org/2001/sw/wiki/File:2010-10-22.dcam-joint-meeting
   -20101006.pptx.pdf

      [9] http://www.w3.org/2001/sw/wiki/File:2010-10-22.dcam-joint-meeting-20101006.pptx.pdf

   TomB: with pete johnston, walk through history of DCAM
   ... early 2000s: two mindsets: RDF and record format mindset
   ... interoperability among DC implementations problematic
   ... but RDF hard sell: researchy; perceived as flavor of XML

   TomB: role of DCAM: bridge between mindsets; tree struct vs. graphs
   ... DCAM future: descriptive patterns reflecting existing metadata
   practice
   ... notion of bounded records
   ... notion of constraints
   ... (shows diagram summarizing DCAM)

   TomB: DCAM can be expressed in diff syntaxes; RDF/XML, HTML
   ... common interface for operating across syntaxes
   ... allows diff applications to communicate
   ... DCAM family: DCAM, DSP, syntaxes (DC-XML, HTML, DC-Text, ...),
   user guidance (Singapore Framework, Guidelines ...)
   ... Description Set Profile Constraint Language: layer on top of
   DCAM
   ... example: book, creator. Template for instances of Book
   ... Statement template: slots: property, literal value, language,
   Syntax Encoding Scheme
   ... Statement template for creator, only use slot value string
   ... "cookie cutter" for creating descriptions; Book's title is a
   literal, creator with dcterms:creator
   ... wiki syntax for combining template representation and html
   presentation of template
   ... XML syntax for DSPs
   ... motivation: configure metadata editor; use template to generate
   form for entering metadata
   ... validating metadata
   ... create OWL expression of constraints
   ... (diagram of Singapore Framework)

   TomB: interoperability levels: informal; semantic; Description set
   syntactic interop; Description Set Profile interop
   ... ~ shared Natural languge, shared formal model, shared records,
   shared constraints
   ... future scenarios: (1) carry on as before (2) DCAM 2 spec, better
   aligned with RDF (3) deprecate, continue with RDF (4) nothing
   ... (1) interest? editors? review?

   (2a) simplified and better aligned with RDF; structural constraints
   of APs

   TomB: impact of DCAM 2 on DCAM family?
   ... (2b) goal: clarification; transitional, to be deprecated in
   favor of RDF
   ... (3) negative impact? existing specs status? change in message?
   basis for APs gone?
   ... (4) does DCMI stand behind it or not? reputation? credibility?
   ... DCAM abstract syntax vs. RDF
   ... Descritption (sets) ~ named graphs?
   ... VES ~ SKOS concept schemes?
   ... use of rdf:value continues or something else such as
   skos:prefLabel?
   ... Issue: APs
   ... syntax pattern checks; checking patterns in the graph? Use OWL
   with closed world assumption?
   ... split in Singapore Framework
   ... constraints in underlying vocabulary or patterns on the data?

   <edsu> what the room looks like (if you are interested)
   [10]http://www.flickr.com/photos/inkdroid/5105654040/

     [10] http://www.flickr.com/photos/inkdroid/5105654040/

   <mini> @edsu hey, thanks, I was just wondering

   <andypowe11> @edsu thanks

   Jon Phipps: continue developing DCAM only realistic option
   ...RDF no notion of record, DCAM provides that
   ... enormous value outside RDF world

   TomB: remote participants comments?
   ... additions to presentation?

   <andypowe11> nothing from me at this stage

   <andypowe11> i'm lost - is the floor open for discussing the
   options?

   <antoine> @andy : yes

   Akira Mijasawa: DCAM DCAM2 differences?

   TomB: DCAM2 mostly RDF except where RDF does not have constructs
   ... get rid of DC terminology that is mapped to RDF

   andypowe11: options 2b, 3 and 4: all work to RDF, which is where we
   want to get to
   ... which of these is better to get to that end game, wrt time
   available
   ... 4 seems not ideal, but less effort
   ... lean to 3; 2b has political value by taking along community; but
   3 better given time

   <edsu> loud and clear :-)

   Stu Weibel: frustrated; no productive outcomes all these years
   ...we should just adopt Web as the model
   ... nobody understands DCAM
   ... W3C published architecture document after actual implementation
   ... revive effort: develop reference software; easily drop in data,
   generate linked data

   andypowe11: support Stu
   ... DC efforts was trying to say Web is model; got confused

   TomB: gap: how to express constraints? Or not necessary?

   <jphipps> Just because the DCAM is poorly expressed and poorly
   understood, doesn't obliterate its value as a model

   <jphipps> The world is NOT rdf-centric and is not likely to be

   Michael Panzer: was puzzled by description sets; but it does make
   ontological commitment clear
   ... bundles of assertions have to make sense; requires way to
   communicate this
   ... RDF struggles with same issues
   ... DCMI should get involved with RDF development

   <jphipps> It's clear to me that even (maybe especially) the creators
   of the DCAM don't understand its value

   mikael nilsson: DC close to data and data production
   ... lots of RDF data being produced
   ... different position now: syntax not problem anymore
   ... RDF encounters problems that DC has too

   <petej> I agree w Andy that the RDF model is where we want to get
   to, and 3 seems to me the best option, tho I'm willing to be
   persuaded there is a value in 2b
   ... look at problems, solve collaboratively
   ... DCAM starting replicating stuff in RDF; RDF has broader base
   ... DC produces vocabulary that's used in RDF; produces set of terms
   not linked to RDF in natural language

   <kai> Scribe: Kai Eckert

Application Profiles and OWL

   presentation by Jeff Young:
   [11]http://lists.w3.org/Archives/Public/public-lld/2010Oct/0101.html

     [11] http://lists.w3.org/Archives/Public/public-lld/2010Oct/0101.html

   <andypowe11> i'm going to drop out at this point - can't see
   presentation or hear very well - sorry

   <edsu> andypowe11: thanks for voicing your opinion so clearly

   <andypowe11> bye all - enjoy rest of the conf :-)

   Jeff Young: Introduction to next presentation: Application Profiles
   in OWL
   ...I Want to make sure that I am not that familiar with DCAM and
   that I come from the Semantic Web world.
   ... shows picture of FRBR as a DCAP domain model
   ... from SWAP
   ... simple translation to OWL, classes, properties, ...
   ... domain and range restrictions are used in OWL

   Jeff Young: I want to name the things, so I introduced
   UnnamedAbstraction...

   <LarsG> UnnamedAbstraction is a name for the union of Work,
   Manifestation and Item

   <mini> yep

   scribe: comparison to UML diagram
   ... cardinalities in OWL does not prevent anyone from ignoring them

   <mini> thx...

   <JennRiley> TomB: unclear to me if there's widespread support for
   keeping the dev't of some kind of constraint language. (I think we
   need this but didn't have a chance to get up to say so.) So I think
   we should verify the degree of support for that. And if there is
   support, discuss whether to do it within DCMI or use resources to
   push this in core RDF

   Jeff Young: Example from Toms Presentation with DCAM usage
   ... DCMI Type Text is a Class but you can not be sure in the XML
   representation

   <mini> I would like to see this done with RDF community on board, in
   any case.

   <mini> Even the Topic Maps standard has a CL

   <mini> -- [12]http://www.isotopicmaps.org/tmcl/tmcl.html

     [12] http://www.isotopicmaps.org/tmcl/tmcl.html

   scribe: in RDF, a subject should be a concept
   ... everything else about the concept should be get by dereferencing
   ... I cashed it here to let it look more like a record. You want the
   additional data here. You want to cache.

   TomB: DCAM is historical and DC-RDF is also available.

   Jeff Young: This is an example how I progressed.

   <TomB> Jeff is showing the first page of XML output extracted by the
   wiki tool from SWAP

   Jeff Young: Now I try to convert that into OWL
   ... Types are already there, so let's look at the title

   <TomB> Jeff tripped up on fact that the first property cited in SWAP
   is dc:type (and he thinks in terms of rdf:type).

   <TomB> Michael Panzer is coming to the microphone, setting up.

   Switch to Michael Panzer

   <TomB> JennRiley - let's come back to this during the discussion in
   the second half

   Michael Panzer: Main difference between using OWL for DSP vs. DSP
   constraint language:
   ... DSP CL example
   ... Title has mincardinality of 1
   ... title has to be there
   ... a title with two types would not be valid
   ... with two types you could infer that both have to be the same

   <TomB> Pellet - an inference language for OWL 2. Has a dialect that
   treats OWL as a constraint language.

   Michael Panzer: test with pellet shows constraint violation
   ... with removed type it is valid

   <TomB> Mikael: Nice because you add the constraints to the class.

   Michael Panzer: People should remind that OWL approaches constraints
   in a different way

   Maja Zümer: Explains that Work, Expression, ... are no subclasses,
   so there was a reason to model it that way

   Akira Mijasawa: How can we incorporate management properties of the
   record? Who made it, ... provenance information, management
   properties

   <TomB> Akira: Description set constraints in OWL - how can we
   incorporate [metametadata]?

   Jeff: Named Graphs would be a possible solution, probably not the
   best, but possible. We can attach properties to graphs

   <TomB> Jeff: Create new entity, "record", attach property to that.
   Not clear how much overlap how much DCAP and how much OWL can
   express. Not clear to me.

   Jeff: Lot of further work to be done, it is not yet clear what OWL
   can do for us, what DCAM can do...

   Coffe Break

   <mini> i'll drop out here, thanks for an interesting discussion.

   <mini> I'll add a few lines of comments for the discussion later:

   <mini> 1. We need very concrete functional requirements, what kinds
   of constraints do we need? what precisely is "validation"? based on
   example records and profiles.

   <LarsG> Perhaps we should forget about records. We use to think in
   records because that's what we had, but now we have new
   possibilities. The metametadata problem is really the same as with
   provenance, and there's work underway with that, too.

   <mini> 2. We can test if OWL with constraint semantics can do it,
   and if DSPs can.

   <mini> 3. The critical question is: based on DCAM, or based on RDF.
   I certainly prefer the latter, but requires DCMI to adopt RDF.

   <mini> I personally see many advantages and potential use cases for
   an RDF CL that can specify "valid" graphs down to every last triple.

   <mini> Now I'm off, good luck!

   <petej> I'm leaving too. Thanks for discussions. I think Mikael's
   closing comments above summarise the key issues/questions very well

   Coffe break is over

   still scribing

   TomB: Repeats minis statements for the audience

Application profile uses

   Marcia Zeng: Presentation about Application Profiles (based on FRSAD
   model) for subject domains:
   [13]http://www.w3.org/2001/sw/wiki/File:FRSAD-AP.ppt.pdf
   ... Questions: 1. Why do we need APs for FRSAD?
   ... and two more...
   ... FRSAD conceptual model. Notion of thema: anything that can be a
   subject of a work
   ... different ways to group things
   ... examples: FRBR, SUMO
   ... even within one domain it is difficult to map thesauri.
   ... In general relationships between themas are hierarchical
   ... but there are others, ALA came up with 100s
   ... different types of KOS have different types to represent
   concepts: classifications, theauri, ...
   ... 2nd question: How formally can the AP be defined?
   ... communities have different domain models and usage guidelines
   ... FRSAD-AP Functional Requirements:
   ... in general vocabularies, but with specific different
   applications

     [13] http://www.w3.org/2001/sw/wiki/File:FRSAD-AP.ppt.pdf

   <TomB> FRSAD is a general model. Need more specific models for
   different types of vocabulary (classification versus thesauri),
   subject domains (medical vs consumer heatlh)...

   Marcia: FRSAD-AP domain model: a general model, needs more specific
   ones for different types of KOS

   <TomB> ...what are the characteristics of your subject vocabulary?

   KOS = Knowledge Organization System (Thesauri, Classifications, ...)

   Marcia: Triples have challenges, e.g. how to preserve order

   <TomB> ...specify the set of properties in a particular subject
   domain?

   Marcia: Nomen specifies different, general attributes
   ... Usage Guidlines for FRSAD-AP: Recommendations, e.g. SKOS, MADS,
   standards (BS, ISO)
   ... 3rd question: Difference between APs for subject domains and
   descriptive metadata
   ... serious sameAs issues: Is a concept from one KOS the same than
   the concept of another?

   <edsu> Scribe: Ed Summers

   Switch to Gordon Dunsire

   Gordon Dunsire: Classification/subject schemes presentation:
   [14]http://www.w3.org/2001/sw/wiki/File:ClassificationAP.pptx.pdf

     [14] http://www.w3.org/2001/sw/wiki/File:ClassificationAP.pptx.pdf

   Gordon: there are things in faceted classification schemes that need
   application profiles
   ... semifaceted sub-divisions also have issues that require AP: DDC,
   LCSH
   ... some subdivsions are mandatory in some schemes and optional in
   others
   ... also sequencing is important Law--Sociology, Sociology--Law
   ... something that APs need to address particulary for validation
   purposes

   Gordon Dunsire: FRBRer vs ISBD: OWL vs DCAP
   [15]http://www.w3.org/2001/sw/wiki/File:FRBRerISBD.pptx.pdf

     [15] http://www.w3.org/2001/sw/wiki/File:FRBRerISBD.pptx.pdf

   Gordon: I'm working with FRBR conceptual model: nothing mandatory,
   sequenced or encoded
   ... monolithic record split into 4 related parts, with some
   cardinality constraints
   ... seems to me the best way to model this is w/ OWL
   ... e.g Expression is a realization of *exactly* one Work
   ... not sure how to model that in AP
   ... contrasted w/ ISBD - which is a data model
   ... made up 9 separate sections or areas, sequencing is very
   important
   ... there is also 'manditory if applicable" which makes some things
   required depending on the resource being described
   ... seems to me the best way to model that is a DC application
   profile
   ... there are aggregations
   ... I'm wondering if there need to be 2 separate approaches, and how
   others would do it

Brainstorming

   TomB: any questions?
   ... I'd like to circle back to the OWL method, I understood from the
   discussion before the break that the idea was to model constraints
   with OWL, and to validate those constraints with closed world
   assumptions
   ... in pellet the owl is used to generate a sparql query to validate
   ... can someone confirm this?

   Michael Panzer: pellet is an owl2 reasoner, for doing inferencing
   ... but there is a project called pellet integrity constraint
   validator [16]http://clarkparsia.com/pellet/icv/
   ... it doesn't change anything in your owl, but it generates sparql
   queries from the owl ... the same owl is used for both the
   inferencing and the validation
   ... the integrity constraints wouldn't generate any inferences

     [16] http://clarkparsia.com/pellet/icv/

   Karen Smith-Yoshimura: i'm trying to separate what it is, from what
   you are doing with it

   scribe: sequencing (how things are presented) needs to vary on
   language context, and the application
   ... I'm not sure what happens with translations

   TomB: i wonder if jon or corey might have some thoughts
   ... do RDF and linked data need standard approaches to "application
   profiles"?

   Stu: do application profiles need to consider RDF/LInked data to be
   useful? Does Linked data/SemWeb need application profiles?

   Antoine: that's a valid question. in rdf there isn't so much
   guidance on how to reuse vocabularies. i think semweb community
   could benefit from this

   JonPhipps: an application profile at this point is documentation,
   too many organizations lack the documentation about their data,
   similar to what mike bergman talked about this morning

   TomB: are there only documentation requirements, or do we need to
   express constraints?

   JonPhipps: i'm deeply critical of people who think they have the
   answers in this space

   :-)

   TomB: not looking for answers, but suggestions

   JonPhipps: if you don't document what your data is, are you really
   communicating anything? It seems essential for trust.

   Gordon Dunsire: i think isbd would be a lot easier to understand as
   an AP. for communicating what this thing is

   Emmanuelle: i don't feel like i can say what's good for rdf, but the
   library community needs something that's like AP but for the linked
   data world

   TomB: i'm hearing a requirement to communicate the purpose and
   substance of a metadata model to a community for coherence of data
   and sharing an approach
   ... not hearing a clear requirement for standardizing an approach to
   modeling constraints for validation. does anyone want to argue for
   that?

   Gordon Dunsire: look at the FRBR model, if you convert legacy data
   to that model, having something you can validate aggregations of
   triples is quite important

   JennRiley: i agree, there are two reasons validation is important:
   it makes tool support easier ; it's also important for public
   relations, to constrain the world of linked data, and allows you to
   scope the web of data into manageable chunks (my wording)

   TomB: is the Description Set Profile language a good start at that?

   JennRiley: i don't have an opinion about whether it needs to be dcmi
   related

   Jeff Young: i think we should come up with some example use cases,
   it's hard to say -- we are grasping here

   TomB: can we identify different scenarios for different types of
   profiles?

   JonPhipps: there is creation metadata, there is the publishing
   metadata, and there is the consumption of the metadata
   ... there isn't a notion of constraints around publishing /
   consuming data for rdf ; those are areas that need to be covered by
   an AP

   antoine: there is agreement that some guidance should be provided
   when using vocabularies, but does this require a langauge?
   ... the fact that there was a formal language for the description
   set profile wasn't useful to me

   JonPhipps: i second that

   <emma> Markva asked wether Antoine's comment implied to stop effort
   on DCAM

   TomB: we have the singapore framework, if we ignore the DCAM is the
   rest valid?

   [Singapore Framework diagram on screen:
   [17]http://dublincore.org/documents/2008/01/14/singapore-framework/s
   ingapore-framework.png]

     [17] http://dublincore.org/documents/2008/01/14/singapore-framework/singapore-framework.png

   Diane Hillman: the idea that we will have to explain AP in terms of
   RDF...i've been through lots of phases of technical wonder. i'm
   worried that we are getting too far into thinking in one mode, need
   more general thinking than that

   JonPhipps: [gesturing at large parts of the Singampore Framework
   diagram and saying it is documentation related]

   TomB: what about data format?

   JonPhipps: that's a specification, perhaps somewhere else like SKOS

   (thumbs up from the modelers in the back)

   Michael Panzer: the abstract model is a meta model, and in this way
   in clashes with RDF
   ... how would you do some of the things in the DCAM with OWL? are
   you going to throw out some requirements?
   ... we could get involved in rdf next steps. but in the end dcam and
   rdf are at odds, and one must win

   Stu: Jon's assertion that we have confused syntax and semantics is a
   really strong point
   ... i wonder if someone is willing and able to explain what the
   abstract model means. we know how to describe items. i don't
   understand the singapore framework. we've got models that we don't
   believe. we haven't connected them with what we are trying to do.
   ... I'm not saying DCAM or RDF must win. if we were to sit down and
   write a document that would not allow us to use models, triples,
   domain models ... a plain natural langauge description of what we
   are trying to do...i tried to write about it in my blog and i got
   feedback that I didn't understand it.
   ... if you can't describe what the framwork is to practitioners then
   we can't move forward

   JonPhipps: the value of the upper two layers is that they allow us
   to document a domain model, in a way that is independent of the
   bottom layer (the implementation)
   ... it provides a valuable documentation model, there are bits that
   are too technical. it would help to have it rewritten in a way
   that's understandable.

   markva: could add some documents that explain it in very clear ways,
   like what the owl community has done
   ... could add some documents that explain how to go from the
   conceptual level to the implementation

   antoine: keeping the rdf reference you can do without a reference
   implementation guideline, that might not even express all the
   requirements. Just as informal guidance

   TomB: Michael do you think you can do without this bottom layer of
   RDF?

   Michael Panzer: the question is more where the wind is blowing
   ... why build it on RDF? do we do it because it's a good brand, or
   that it's useful? how important is that?
   ... the DC of working with metadata, will enough people find it
   useful without anchoring it to the RDF specs?

   JonPhipps: perhaps the bottom layer can be informative, and the
   middle layers would be normative

   TomB: adjourned
     _________________________________________________________


    Minutes formatted by David Booth's [18]scribe.perl version 1.135
    ([19]CVS log)
    $Date: 2010/11/02 16:29:05 $

     [18] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm
     [19] http://dev.w3.org/cvsweb/2002/scribe/



-- 
Tom Baker <tbaker@tbaker.de>

Received on Thursday, 4 November 2010 17:15:38 UTC