- From: Martin J. Duerst <duerst@w3.org>
- Date: Wed, 31 May 2000 14:26:01 +0900
- To: www-xml-schema-comments@w3.org
Forwarded on request of C. M. Sperberg-McQueen3
>Date: Tue, 30 May 2000 18:11:01 +0900
>From: "Martin J. Duerst" <duerst@w3.org>
>Subject: I18N Last call comments on Schema Part 1
>
>Dear Schema WG,
>
>[This mail is crossposted to the I18N IG to allow for further discussion.
>Please feel free to forward these comments to another list, including
>a public list, but please make sure that you don't reveal the mail
>addresses of the various groups.]
>
>This are the last call comments on XML Schema Part 1: Structures
>from the I18N WG/IG.
>
>The comments are numbered by [n], but their order does not
>reflect their importance.
>
>
>[1] The spec repeatedly contains language such as
> "the string composed of the [character code] of each of the
> element information item's character information item [children]
> in order"
> This is overly complex and confusing. First, a string is composed
> of characters, not of character codes (which are numbers). This has
> to be corrected. Second, the phrase is used so often and the concept
> behind it so obvious that it would help a lot to define a term for
> it once.
> [Similar phrases are also found in Part 2, this comment should also
> be refelected there; it is made here only once for both parts.]
>
>[2] Section 3.12 says: 'In the case of {user information}, indication
> may be given as to the identity of the (human) language used in
> the contents, using the xml:lang attribute.'
> Please change 'may' to 'should'. Also see points [3], [4], [5].
>
>[3] Please indicate how annotations in multiple languages are done.
> Being able to make annotations in multiple languages in a
> clearly defined and interoperable way is important.
>
>[4] Section 5.9, in point 2, says that the value of xml:lang must
> conform to the req's set out in XML 1.0. There are two problems
> here:
> - The exact details of the requirements in XML 1.0 are being
> dealt with as errata (see also point [31] of our comments to
> Part 2).
> - There should be no need to spell this out; this should be
> clear by the fact that XML Schemas is about XML.
>
>[5] It should be made clear that <documentation> can contain
> additional markup. As neither <annotation> nor <documentation>
> is defined in App. A, this isn't clear.
>
>[6] It should be clear that for all references of URIs/URI References,
> this is to be understood as including the provisions of relevant
> section of the W3C Character Model (http://www.w3.org/TR/charmod/#URIs).
> Please see point [30] of our comments to Part 2.
>
>[7] In http://lists.w3.org/Archives/Member/w3c-xml-schema-wg/1999Nov/0007.html
> we have made a detailed request to make sure that XML Schemas can
> address the problems of i18n-related markup. This detailed request
> was listed as issue 209 but summarily abandoned.
> http://www.w3.org/XML/Group/xmlschema-current/issues.html#easyAddIns
> We have not received any response that would allow us to determine
> that these issues are addressed satisfactorily in the current spec.
> We herewith resubmit the abovementioned mail as part of this last
> call comment, and request the XML Schema WG to provide a detailled
> answer as part of the resolution process so that we can decide whether
> our requirements are met. Apart from this general answer with followup,
> we mention a few specific points below [8].
>
>[8] The mail mentioned in [7] mentions addition of elements and attributes
> in general, but one particular and particularly frequent case is the
> addition of child elements to elements that do not have any child
> elements defined yet. In the current draft, such elements can be
> defined in two ways, either as 'mixed' without any elements specified
> or as 'string'.
> [There may be a third one, 'textOnly', as guessable
> from 4.3.3. However, the spec seems not consistent on this. For example,
> there is:
> {base type definition}
> The type definition resolved to by the value of the base
> [attribute],
> if present, otherwise the simple ur-type definition if the content
> [attribute] is textOnly, otherwise the complex ur-type definition.
> but earlear, there is only one ur-type, so this is confusing.]
>
> In order to make extensions easy, the 'mixed' type without child elements
> and the string type (as long as not restricted by a facet, and see point
> [9]) should be merged. In terms of functionality, this should not
> provide any problems at all, because it is just a question of deferring
> decisions until they really are necessary.
>
> It may be claimed that instead of merging 'mixed' and 'string' as above,
> it would suffice to always use 'mixed' in cases further addition of
> elements is desired. However, we feel that this is not sufficient,
> 'string' is too easy to use and will be used in too many instances.
>
>[9] As explained in item [35]/[36] of our comments to part 2, it will often
> be necessary to include character repertoire constraints in XML Schema.
> Such constraints should also be applicable to character children even
> if an element also has element children. This can easily be done by
> allowing a pattern facet even on complex types provided that this
> pattern facet only consists of a character class expression.
> This does not pose any problems with respect to the interleaving
> order of characters conforming to the pattern and elements conforming
> to the content model.
>
>[10] The verbal complexity of the XML Schema specs, in particular part 1,
> is extremely high. We have serious doubts regarding understandability
> by non-native speakers as well as translatability. We ask the XML
> Schema WG and the editors to undertake every effort to use clear and
> simple language.
>
>
>Regards, Martin.
Received on Wednesday, 31 May 2000 01:53:51 UTC