W3C home > Mailing lists > Public > xmlschema-dev@w3.org > May 2012

Re: ANN: a tool that makes it easier to extract information out of XML Schemas

From: Costello, Roger L. <costello@mitre.org>
Date: Sun, 20 May 2012 11:54:40 +0000
To: "xmlschema-dev@w3.org" <xmlschema-dev@w3.org>
Message-ID: <B5FEE00B53CF054AA8439027E8FE1775182B3118@IMCMBX04.MITRE.ORG>
Hello Michael,

> Did you look at Saxon's SCM file format? 

Yes.

>  It seems to me that if you want to present schema information 
> in a processable form, the SCM is the right model to use.

Respectfully, I disagree.

A colleague conducted a bake-off between SCM and my tool and in every case it was easier to extract information out of XML Schemas using my tool.

Additionally, my tool is implemented in XSLT and can be used with any XSLT processor.

/Roger


On 19/05/2012 19:42, Costello, Roger L. wrote:
> Hi Folks,
>
> I created a tool that makes it easier for you to extract information
> out of XML Schemas.
>
> Here is the tool:
>
> http://www.xfront.com/XML-Schema-Tool-for-Easy-Information-Extraction/index.html
>
> Motivation for the tool:
>
> Here are a few examples of queries that I've needed to perform
> on schemas in the past:
>
> - What are all the elements and attributes that are declared
>    to be of type xs:QName (or xs:string, or xs:gYear, etc.)?
>
> - For simpleType A, what are its applicable facets? (Take
>    into account the facets in all its ancestor simpleTypes)
>
> - How many element declarations are in the schema? How many
>    complexType definitions? simpleTypes? attributes?
>
> - How many lines of schema code are there?
>
> With my tool it is easy to get answers to those questions.
>
> Without this tool, it can be difficult to get the info you desire
> from XML Schemas. Here are a few reasons for the difficulty:
>
> 1. The schema may be scattered over multiple files. So you have
>     to search through multiple files to find the info you want.
>
> 2. A simpleType may be part of a long chain of restrictions. And the
>     simpleTypes may be scattered over multiple files. That
>     makes it difficult to know exactly what is the net value space
>     for the simpleType.
>
> 3. Likewise a complexType may be part of a long chain of derive-by-
>     extensions and derive-by-restrictions. And the complexTypes
>     may be scattered over multiple files. That makes it difficult
>     to know exactly what is the final set of elements and attributes
>     in a complexType.
>
> 4. An element may be substituted. So, many different elements may
>     be possible at a certain point in a schema.
>
> 5. Consider an element declaration with a type attribute. The type
>     definition could be located in many places: in the document that
>     the element declaration is located in, in a document that it
>     includes or imports, or one that they include or import. It
>     could be in the document that included the document that
>     contains the element declaration. And many more places. Ouch!
>
> 6. The elements and attributes in a no-namespace schema are
>     part of one namespace when they are included by a schema with
>     targetNamespace A and another namespace when they are included
>     by a schema with targetNamespace B.
>
> /Roger
Received on Sunday, 20 May 2012 11:55:09 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 14:56:20 UTC