The EUROVOC Thesaurus Ontology Schema

Technical Description and Usage Conventions

Date: 2009-10-26

Owner:

            the Publications Office of the European Union (http://publications.europa.eu/),

           © 2009 The Publications Office of the European Union
           Originating departments — Copyright
           2, rue Mercier
           L-2985 Luxembourg

Authors:

            Johan De Smedt (http://www.tenforce.com),

            Bernard Vatant (http://www.mondeca.com)

1.    The EUROVOC Thesaurus

The mission and use of the thesaurus is documented in English and in French.

The thesaurus is maintained by the OPOCE and is available via its portal.

A schema specific documentation was generated using OWLDoc via Protege 4.  It is available via EUROVOC OWLDoc.

1.1    Namespace and Identification

Namespace: http://eurovoc.europa.eu/schema#.

The ontology and this description are available from http://eurovoc.europa.eu/ontology/.

 

Reading convention: the following namespace abbreviations are used in this document:

1.2    Outline of the Modelling Approach.

The EUROVOC ontology schema is an extension of:

- SKOS Simple Knowledge Organization System

   Reference

   W3C Proposed Recommendation 15 June 2009

Including: Appendix B. SKOS eXtension for Labels (SKOS-XL)


The imported schemas are:


For technical documentation on SKOS please see the SKOS reference and the SKOS Primer.

1.2.1    Used SKOS Entities

The SKOS classes and properties listed at the end of this section are used when publishing the EUROVOC thesaurus.
SKOS inference may be applied (e.g. to calculate inverse properties). The specification and semantics of these classes and properties are available in the SKOS reference and the SKOS Primer.
Note: Most of these classes and properties are specialized with EUROVOC specific extensions.

1.2.2    Used Schema Annotations

Some Dublin Core properties are reused and re-declared without importing the complete DCMI RDF schemes.


EUROVOC extensions define artefacts that either are specified as top level classes and properties or as sub-classes and sub-properties of the SKOS and SKOS-XL artefacts.

When possible, EUROVOC specific extensions will entail SKOS artefacts.  The specifications of these inference rules are detailed using a EUROVOC schema annotation property:


The range of some EUROVOC properties is an XML literal value. For these properties, the OWL schema details the allowed XML structure using the annotation property:

2.    The EUROVOC Thesaurus Ontology Schema

2.1    Thesaurus Publication

2.1.1    Defined artefacts

eu:Export

eu:exportDate

eu:exportedThesaurus

eu:exportVersion

eu:intermediateRelease

eu:Language

eu:language

eu:supportedLanguage

eu:Thesaurus

2.1.2    Definitions

A EUROVOC thesaurus [eu:Thesaurus] is a SKOS Concept Scheme.  Subsequent publications of the thesaurus are exported from the back-office maintenance system.

The publication characteristics of the thesaurus are given by the back-office export [eu:Export] instance.


An eu:Thesaurus instance lists the languages for which its thesaurus concepts provide a preferred label (via skos:prefLabel or xl:literalForm).


The eu:Language class is a convenience class.  Each instance represents a language.  By convention, the URI of an eu:Language instance is the registered public subject indicator (see http://psi.oasis-open.org/iso/639/#).

2.2    Thesaurus Organisation

The overall structure decomposes the EUROVOC thesaurus into a set of Domains and a set of Micro-Thesauri.  The Domains form a mathematical partition of the micro-thesauri.  A micro-thesaurus is a concept scheme. All concepts of a micro-thesaurus are concepts of the complete EUROVOC thesaurus.

From these definitions, a domain is neither a skos:Collection nor a skos:ConceptScheme.

2.2.1    Defined Artefacts

eu:Domain

eu:domain

eu:hasPolyHierarchy

eu:inDomain

eu:microThesaurus

eu:MicroThesaurus

2.2.2    Definitions

2.3    Thesaurus Concepts

2.3.1    Defined Artefacts

eu:Country

eu:isoCountryCode

eu:ThesaurusConcept

2.3.2    Definitions

2.4    Thesaurus Notes and References

EUROVOC notes are SKOS notes with a particular usage convention to facilitate on-line publishing.

2.4.1    Defined Artefacts

eu:language

eu:noteLiteral

eu:reference

eu:relevantURL

dc:source

2.4.2    Definitions

Within EUROVOC the content model of a SKOS note is strictly modelled as an RDF resource (a blank node or a non blank node are allowed) holding 2 properties:


The other properties are:

2.4.3    Usage Note for Encoding eu:noteLiteral with XHTML+RDFa Mark-Up

<skos:definition rdf:parseType="Resource">

   <eu:noteLiteral rdf:parseType="Literal" xmlns="http://www.w3.org/1999/xhtml">

      <h3>Usage of eu:reference as a link in notes (Scope note, History note, definition, ...)</h3>

      <ul>

         <li>Notes can have XMLLiteral values with mark-up from the xhtml
          namespace (http://www.w3.org/1999/xhtml).

            The XHTML vocabulary must be compliant with XHTML version='XHTML+RDFa 1.0'.

            The content will NOT include the xhtml:body element, only its content (plain text, div, ...)

         </li>

         <li>In a note, an href reference may refer to a EUROVOC Concept or
          a EUROVOC term (i.e. a SKOS-XL Label). Example:

            <a rel="eu:reference" href="http://eurovoc/europe.eu.....">xyz</a>

            or <a rel="eu:reference" href="http://eurovoc.europa.eu/C2448">xyz</a>

         </li>

      </ul>

   </eu:noteLiteral>

   <eu:language rdf:datatype="http://www.w3.org/2001/XMLSchema#language">en</eu:language>

</skos:definition>

2.5    Labels and Label Relations

EUROVOC labels are either preferred terms or non preferred terms.  The non preferred terms are either simple (i.e. consisting of one component) or compound (i.e. consisting of 2 or more components).  Equivalence relationships are called advanced (or complex) relationships because 2 or more labels are involved.

2.5.1    Defined Artefacts

eu:acronym

eu:CompoundEquivalence

eu:CompoundNonPreferredTerm

eu:compoundNonPreferredTerm

eu:EquivalenceRelationship

eu:fullName

eu:permutedLiteralForm

eu:PreferredTerm

eu:preferredTermComponent

eu:qualifier

eu:shortName

eu:SimpleNonPreferredTerm

eu:ThesaurusTerm

eu:translationOf

eu:UF

eu:USE

eu:ufLabel

eu:ufPlusLabel

eu:useLabel

eu:usePlusLabel

2.5.2    Label Definitions

2.5.3    Simple Label Relation Definitions

2.5.4    Advanced Label Relation Definitions

Dedicated EUROVOC classes are used to distinguish USE/UF and USE+/UF+ relationships.

2.6    Versioning Properties

2.6.1    Defined Artefacts

eu:termReleasedWithVersion

eu:useInstead

2.6.2    Definitions

2.7    Maintenance Properties for Resources in an Intermediate Release

2.7.1    Defined Artefacts

eu:approvedForRelease

eu:status

eu:toBeTranslated

2.7.2    Definitions

2.8    General Auditing Properties

2.8.1    Defined Artefacts

dc:identifier

dcterms:created

dcterms:modified

2.8.2    Definitions

The Dublin Core defined metadata properties may be available on any instance to define