Previous Search Results Return to Search Next
homeadvisoryhomehomemy homepagesearchhelp
Research Note
Tactical Guidelines
04 June 1999
XML Schema Fuel Interenterprise Agreements on Content
B. Lheureux, R. Knox

Businesses increasingly use XML schema, in the form of DTDs, to establish agreements or IT-based contracts for structuring the content exchanged between applications. The XML Schema specification will impact this practice.

Core Topics

Corporate Publishing ~ Electronic Workplace

Web Middleware and Integration ~ Software Infrastructure


Key Issues

What middleware technologies and standards will be best suited for connecting Web browsers and Web servers to enterprise data and application systems?

What technologies will emerge to implement the IPA and its integration with related architectures?


Strategic Planning Assumption

XML Schema, a superset of DTD, will be a W3C recommendation in 1H00 (0.8 probability).


Tactical Guidelines

To address limitations in DTDs' expressive abilities, enterprises should consider using XML "comments" or other documentation in conjunction with custom application logic to establish thorough agreements on the structure of XML-defined content.

After XML Schema becomes a W3C recommendation, enterprises should consider using that instead of DTDs, particularly to take advantage of XML Schema's increased expressive abilities (e.g., for standard XML data types and value constraints).



XML is increasingly used by business partners in application-to-application (intraenterprise) and business-to-business (interenterprise) application integration projects to establish agreements on the structure of content (see Note 1). Part of the reason for this trend is XML's unique value as a portable data standard. XML-based agreements define business data such as purchase orders and securities transactions. The most common way to express such agreements in XML is by creating a DTD, Document Type Definition, which is part of the XML-based specifications (see Research Note T-07-9653, XML Specifications Proliferate — Hierarchically).


Note 1

Definition of XML Content

XML-based data exists in many forms, including documents, messages and database entries. The expression "XML content" refers to any of these, but also assumes that such data is well-formed (i.e., properly organized and tagged in the syntax defined by the W3C's base XML specification).


DTDs — The Upside. Driven by XML's adoption (see Research Note QA-08-0150, XML Will Ultimately Pay for Itself in Application Integration), DTDs are already widely used to express agreements formally on the format of XML-defined content (see Note 2). For example, the FIX technical committee, hosted by major securities traders (e.g., Fidelity, Goldman Sachs) has redefined its FIX protocol using XML (i.e., FIXML).


Note 2

DTD Heritage

DTDs are widely used for SGML-defined content in the automotive industry (SAE J2008) and for scientific, technical and medical publishing (ISO 12083). In the context of this Research Note, we are specifically referring to the more recently defined DTDs developed with XML.


Enterprises and industry groups use DTDs to express agreements on data structures for two key reasons: they are relatively easy to read, define and understand; and they are based on the increasingly accepted XML specifications. DTDs cleanly separate the agreement on structure from instances that conform to that structure. In the typical scenario, a DTD is created, posted on a Web site for general availability and referenced from within XML documents. XML parsers can obtain the DTD reference from within XML documents and access it to validate content. This technique enables any application — anywhere — to receive and process XML content and parse it against a common, shared DTD.

DTDs — The Downside. DTDs suffer from severe limitations in semantic expressive ability. DTDs have no standard way of expressing even primitive data types (e.g., text, numeric) or value constraints (e.g., "income less than $50,000"). These limitations are typically addressed via nonstandard XML extensions, external documentation or software libraries to enforce constraints. Thus, because XML parsers cannot do full data validation, and because of the negative impact of validation on performance, enterprises rarely use the XML parser validating feature in application integration projects. This also renders most DTDs incomplete and, in some cases, incompatible with likely future XML specifications (e.g., XML Schema). Despite their severe technical limitations, DTDs are the de facto standard method for using XML to express agreements on the structure of content.

XML Schema — A Better DTD. To address DTDs' limitations, the W3C XML Schema (see Note 3) working group has been considering a variety of proposals (e.g., XML-Data, DCD) that extend DTDs' expressive ability. W3C's release of a working draft of XML Schema is a significant step toward "closing the loophole" in DTDs' limited form of expression, and potentially enabling validation to occur at the parser level rather than in custom application logic. Moreover, there is now a movement within the W3C to merge the RDF Schema and XML Schema specifications — further consolidation. XML Schema, a superset of DTD, will be a W3C recommendation in 1H00 (0.8 probability).


Note 3

"XML Schema" vs. "XML Schema"

As if XML was not confusing enough, note that XML Schema, the proper noun, refers to the W3C working draft proposal for a new XML specification used to define XML schema, which are instances of industry- or domain-specific agreements on the structure of XML-defined content (e.g., MathML, OFX). The latter is also simply written as schema for both the singular and plural references to such instances.


Until the XML Schema is approved, enterprises should use DTDs. However, there is no standard way in DTDs to express data types and value constraints. To work around this, enterprises should use XML comments (see Note 4), external documentation or software library routines to enforce constraints. Enterprises should avoid using proprietary XML extensions to express such constraints since these will likely be incompatible with specifications such as XML Schema. After XML Schema becomes a W3C recommendation, enterprises should consider using that instead of DTDs, particularly to take advantage of XML Schema's increased expressive abilities (e.g., for standard XML data types and value constraints).


Note 4

Reusable Comments

For maximum leverage, comment fields in the XML instances could include data typing expressed in XML Schema to be reusable once XML Schema is released. Working drafts of the XML Schema specifications can be found at www.w3.org/TR/.

Acronym Key

DCD          Document Content Description

DTD          Document Type Definition

FIX          Financial Information Exchange

FIXML     Financial Information Exchange Markup Language

MathML     Mathematical Markup Language

OFX          Open Financial Exchange

RDF          Resource Description Framework

SAE          Society of Automotive Engineers

W3C          World Wide Web Consortium

XML          Extensible Markup Language


Bottom Line: Despite their severely limited expressive abilities, DTDs are the de facto standard for enterprises to use to establish agreements on the structure of XML-based content. To address limitations, DTDs should be used in conjunction with XML comments, external documentation or application logic. Once XML Schema is an approved specification, it should be used instead of DTDs.


This document has been published by:
Service Date Document #
Integrated Document & Output Management 4 June 1999 TG-08-3563
PRISM for Electronic Workplace 4 June 1999 TG-08-3563
Internet Age Publishing 4 June 1999 TG-08-3563
PRISM for Applications Development 4 June 1999 TG-08-3563
Application Integration and Middleware Strategies 4 June 1999 TG-08-3563
Web Implementations and Technologies 4 June 1999 TG-08-3563
Application and Web Integration 4 June 1999 TG-08-3563
Internet Strategies 4 June 1999 TG-08-3563

Entire contents (C) 1999 by Gartner Group, Inc. All rights reserved. Reproduction of this publication in any form without prior written permission is forbidden. The information contained herein has been obtained from sources believed to be reliable. Gartner Group disclaims all warranties as to the accuracy, completeness or adequacy of such information. Gartner Group shall have no liability for errors, omissions or inadequacies in the information contained herein or for interpretations thereof. The reader assumes sole responsibility for the selection of these materials to achieve its intended results. The opinions expressed herein are subject to change without notice.

| Gartner Group Home Page | My Home Page | Search |
| Site Map | Help | Feedback | Client Services | Log Off | IT Journal |


Previous Search Results Return to Search Next