- From: Martin J. Duerst <duerst@w3.org>
- Date: Tue, 08 Feb 2000 16:01:07 +0900
- To: www-xml-canonicalization-comments@w3.org
Here are some general last call comments: - At the end of section 1, there should be a short overview over the spec, about one sentence per section. Most of this can be done by just saying that point 1 is discussed in section2, and so on. - At the start of sec. 2, after '[Infoset]', it should say whether canonical XML includes all the required info items, and required properties thereoff, or where there are differences. There are some details about this later for individual items, but it would be good to have a general summary of this relationship. - 2.3, and others: each subsection should fully list the info items that are included, to increase readability and presentational uniformity of the spec. - 2.4: 'For those which': what does 'those' refer to? Information items? Processing instructions? - 2.5: 'could' -> 'can'. - Both in the intro and in 2.6, there should be some text explaining how whitespace is treated. - There is a spurious empty paragraph at the start of sec. 4p - Sec. 4: 'suppose' ... 'then if': Rewrite, e.g. as 'Given a file... and the following XML document'... - Sec. 5: The syntax rules at the start of this section are very dense. This should be explained much better. - As n-tilde is used before, I guess it would be better to use that in place of c-cedillia as an example for a decomposable character. - 5.2: 'Where an element contains two lines are...' ^that - 5.2: In the first list, the second bullet only mentions " ", but not "
". The last bullet only mentions "	", but not "	". There may be other, similar problems here. Also, the case of the document containing a single #xD or a single #xA should be discussed. - 5.2: Codepoint U+000A (#xA) is Line Feed (LF) in IETF/ISO/Unicode, not NL. Please do not invent new things here. Also, CRLF is usually written without hyphen. - 5.6: Maybe it's worth mentionning that the binary ordering of UTF-8 is identical to the ordering of Unicode code points for a string. - 5.6: [5.9 Namespaces]: This is not a reference, and therefore should not be formated as a reference. Please write: 'is described in Section 5.9, Namespaces'. - 5.9: saying that this approach was choosen so that canonicalization is context-independent is a very good start to explain why namespace canonicalization is that space-wasting. However, it is by way not enough. That the canonical form of an element is the same independent of where it occurs in a document cannot be a goal by itself. Please provide a better explanation. - 5.9: While having each namespace newly declared anew on each element might make some sense in some scenarios (depending on the explanation you give for the last point), repeating one and the same namespace declaration on the same start tag is not justified by the note in the current draft, nor can I see any other reason for doing this. - Acknowledgements: W3C Liaison -> W3C Staff Contact - Acknowledgements: U. Ill: please expand. At least people outside the US won't guess that. - Acknowledgements: Please don't acknowledge yourself, or only yourself. To list the members of the WG, a title such as in the XML spec is more appropriate. Regards, Martin. #-#-# Martin J. Du"rst, World Wide Web Consortium #-#-# mailto:duerst@w3.org http://www.w3.org
Received on Tuesday, 8 February 2000 01:58:38 UTC