[ALL,VM] Vocabulary Management Task Force Description - discussion draft

Discussion draft, 2004-06-12

SWBPD "Vocabulary Management" Task Force Description

NAME          
    Vocabulary Management

STATUS        
    Considered

COORDINATORS  
    Tom Baker and ?

MEMBERS
    Libby Miller
    Natasha Noy
    Dan Brickley 
    Alistair Miles ("al")?
    Alan Rector ("alan")?
    James Hendler ("jim")?
    Ralph Swick (maybe)

OBJECTIVES
    The goal of this Task Force is to describe best practice for
    declaring and managing terms and term sets (vocabularies)
    for use in a Semantic Web environment.  Specifically, the
    Task Force will describe the following:
    
    -- Identification of terms with URIs
       -- Term-related entities identified: eg, a term concept,
          a historical version of a term, a set of terms.
          Possibly: a set of terms declared elsewhere and 
          "reused" in an "application profile".
       -- Formation of URI strings: eg, "implied semantics", 
          "# versus /", version numbers in URI strings.
       -- Versioning of terms ("change management"): event-
          based model linking chains of "term versions" to a
          "term concept" [DCMI-VERSIONING].  Analogy to W3C
          practice for identifying document versions?
    
    -- Policies for term declaration and identification
       -- Term-identification policy: eg, "namespace policy" 
          [DCMI-NAMESPACE], expectations about persistence, 
          maintenance, institutional commitment, semantic
          stability.
       -- "Assertion etiquette" ("good neighbor" policies):
          eg, if DCMI and Library of Congress mutually assert
          a subPropertyOf relationship between MARC Relator
          codes with respect to dc:contributor.
       -- "If we want to declare a term but lack the
          institutional context appropriate to a persistent
          namespace policy, how can we do it?  Should I use an
          existing term, get DCMI to declare one, or declare
          my own?  How can I coin a URI?  Where would I put it?"
    
    -- Documenting terms
       -- What should term URIs "resolve to" (eg, TAG 
          recommendations with regard to RDF or RDDL)?
       -- What are minimum expectations with regard to 
          availability of HTML Web pages, RDF schemas,
          and the like?  Who can we point to that does it 
          right?
       -- Is there a notion of "canonical" versus "derived" 
          sources?
       -- Hints for work flow to maintain multiple 
          documentation forms in synch.

APPROACH
    The issues above have been discussed and documented in various
    vocabulary maintenance communities.  The Task Force deliverable
    should provide an overview of the issues involved in declaring
    and maintaining a vocabulary, pointing to available examples
    of good practice and summarizing their underlying principles.

SCOPE
    Beyond the areas outlined above (URIs, Policies,
    Documentation), additional issues are relevant but possibly
    beyond the scope of the Task Force deliverable:
    
    -- Describing terms: What attributes do terms have, and how
       important is it for interoperability to use existing
       attribute sets?
    
    -- Application profiles: Is it valid and useful to distinguish
       between a "term declaration" and a "term re-use" -- eg,
       in annotations or "application profiles"?
    
    -- Thesauri and ontologies: What types of "vocabulary"
       are there (eg, "metadata element sets", "thesauri", 
       "ontologies") and to which do these guidelines apply?
    
    These issues should perhaps be discussed within the Task
    Force deliverable itself in the form of a scope statement,
    when possible with pointers to other relevant documents.
    
DELIVERABLE
    A relatively concise technical note summarizing
    principles of good practice, with pointers to examples,
    about the identification of terms and term sets with URIs,
    related policies and etiquette, and expectations regarding
    documentation.
    
TARGET AUDIENCE
    -- Maintainers of terms and term sets (vocabularies) for use in a 
       Semantic Web environment.
    -- Anyone else wishing to declare terms reusably.
    
DEPENDENCIES
    -- THES - SWBP Thesaurus Task Force
       http://www.w3.org/2004/03/thes-tf/mission
    -- FOAF
       http://xmlns.com/foaf/0.1/
    -- SKOS - SWAD Europe
       http://www.w3.org/2001/sw/Europe/reports/thes/1.0/guide/
    -- Dublin Core - DCMI
       http://dublincore.org/documents/dcmi-namespace/
       http://dublincore.org/documents/dcmi-terms/
    -- Dublin Core - CEN MMI-DC Working Group
       http://www.bi.fhg.de/People/Thomas.Baker/Versioning-20040611.txt
       http://www.cenorm.be/isss/cwa14855/ 
    -- Image Annotation meeting in Madrid
       http://rdfig.xmlhack.com/2004/06/07/2004-06-07.html#1086615887.400193
    
REFERENCES
    [DCMI-NAMESPACE] http://dublincore.org/documents/dcmi-namespace/
    [DCMI-VERSIONING] http://www.bi.fhg.de/People/Thomas.Baker/Versioning-20040611.txt

-- 
Dr. Thomas Baker                        Thomas.Baker@izb.fraunhofer.de
Institutszentrum Schloss Birlinghoven         mobile +49-160-9664-2129
Fraunhofer-Gesellschaft                          work +49-30-8109-9027
53754 Sankt Augustin, Germany                    fax +49-2241-144-2352
Personal email: thbaker79@alumni.amherst.edu

Received on Saturday, 12 June 2004 05:10:46 UTC